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Abstract 


In this paper we show that a symmetric isotropic matrix polynomial in any 
number of symmetric 3 x3 matrices can be expressed as a symmetric isotropic 
matrix polynomial, in which each of the matrix products is formed from at 
most six matrices and has one of a certain number of forms which are explicitly 
given. The significance of these results in the mechanics of isotropic continua is 
indicated. 

1. Introduction 


We consider a body to undergo deformation described in a rectangular 
Cartesian coordinate system x by 


x, = %;(X;,¢), 


where x; and X;, are the coordinates in the system x of a generic particle of the 
body at arbitrary time ¢ and a standard reference time respectively. We assume 
that the stress components ¢,, at the point x; and time ¢ are single-valued func- 
tions of the eeignmaion gradients 0x,/0X, and of the gradients of the velocity 


vy, the acceleration v,,..., the (m—1)th acceleration vy, at that point and 
mie) i.e. the constitutive equation takes the form 
(1) a (n—1) 
Chessy Ox» OU, Sse —— (1.4) 
A) Cir) Ox, 
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It has been shown [/] that if, further, the material is isotropic at the standard 
reference time, the stress components ¢,; must be expressible as single-valued 
functions of the quantities C,, and AN ee 2,...,) defined by 


(1) 


C Xp OX, ay 2 OUp | OVg 
2 ae YS Ga ois : Pq a%, Xp : 
and ba Wyre ao (152) 
(r) (y—1) ay (y—1) CUm 
Apq= SEL tf wee Ama hp : 
where D/Dt denotes the material time derivative; 7.e. the constitutive equation 
takes the form : P 
ti, =b; (C, FU Pte pre yc (1.3) 


where the functional dependence of ¢;; on the arguments may be different from 
that in equation (1.1). 

Now, C,, and A,’ have the property that, if quantities C,,-and Appar 
defined in an Berane manner in any other rectangular Cartesian system % 
moving in an arbitrary manner with respect to x, then Crs and C,, and Aj), 
and A¥’ are the components in the systems ¥ and x respectively of Cartesian 
tensors. 

We now define the symmetric 3 x3 matrices T, C and A, by 


T=|[t;||, EC =||C,;|| and A,=|]Aqj]. (1.4) 


It has been shown [/] that, if we assume the dependence of #;; on the arguments 
in (1.3) to be polynomial, it follows from the isotropy of the material that the 
stress matrix JT’ must be expressible as a symmetric isotropic matrix polynomial 
in the kinematic matrices C and A, (r=1, 2,...,), the coefficients in which 
are polynomial scalar invariants under orthogonal transformations of the matrices 
Chand tay (7 = 452574. Ws 

If only two kinematic matrices occur in the isotropic matrix polynomial 
expression for T’, as would be the case if in (1.1) ¢;; were taken to be a function 
of dx,/0X, and dv;)/dx, only, or of dv}/dx, and dvy’/ax, only, then T may be 
expressed in a closed form [2]. For example, if the argument matrices are A, 
and A,, T may be expressed in the form 


T = yo + yAi+ pode t ps Ai + ye Ao + 5 (Ay Ag + Ay Ai) + 
+ Yg(Ai A + Ap Al) + y, (A, Ad + AZ A)) + yy (AAG + A} A}), 


(1.5) 


where the y’s are polynomial invariants, under orthogonal transformations of 
the matrices A, and A, and may therefore be expressed as polynomials in the 
integrity basis tr A,, tr nh tr Al, trAR tr Al, tr As ir AAS tA A ered 1A, 
tr A? Aj. 

In the present paper, we consider the more general case when the stress 
matrix T is an isotropic matrix polynomial in an arbitrary number of kinematic 
matrices C,A,,..., A, and derive an expression for T as an isotropic matrix 
polynomial of Reed form. The final result is given by Theorem 5 in § 10, by 
substituting C, A,, A,,..., Ay for ap (P==1,9, Ry 

In arriving at Thecvem 5, the following piocodiee is adopted. In § 2, following 
a statement of certain definitions, we derive, essentially from the Hamilton- 
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Cayley theorem, some basic relations which are satisfied by 3 x3 matrices. In 
§ 3, these relations are used to show that any matrix polynomial in R, 3 x3 
matrices can be expressed as a matrix polynomial of lower or equal partial degrees 
in each of the matrices, of extension <R-+41 and of total degree <2 if R=1, 
of total degree <5; if R =2 and of total degree <R + 2, if R>2 (see Theorem 1). 
In § 4, we apply this result to determine the forms of such matrix polynomials 
in the cases when R =1, 2, 3, 4 or 5 and we also investigate the further limita- 
tions which can be made when the matrix polynomial is a symmetric matrix 
polynomial in symmetric matrices. In §§5 to 8, a finite integrity basis for R 
symmetric 3 x3 matrices is determined. Up to this point all the results obtained 
are derived by algebraic operations based ultimately on the Hamilton-Cayley 
theorem. Finally, in §§ 9 and 10, PEANOo’s theorem is employed to limit further 
the number of elements in the integrity basis and the number of terms occurring 
in the closed expression for a symmetric isotropic matrix polynomial in R sym- 
metric 3 <3 matrices. 


2. Some consequences of the Hamilton-Cayley theorem 


We define a matrix product II of a set of R, n xn matrices ap (P =1,2,...,R) by 
Il = apa? ... ax, (2.1) 


wwnetereacin, Of the subectipts ¢,,%5.....7, 1s one of the numbers 1, 2,..., R and 
the indices «1, %),..., %, are positive integers. We can, without loss of generality, 
write II in the form (2.1) with no two adjacent subscripts in the sequence 7,, 
1a, +--+, 1%, equal and throughout this paper we shall always write matrix products 
in this form. Then, a}, af, ..., a¥¢ are called the factors of the product II. These 
factors are said to be of degrees %,a,,...,a, in the matrices a; ,@;,,,..., @;, 
respectively. The number of factors, e, in the matrix product II is called its 
extension. The total degree of II in the R matrices @p is defined as %,+a,+---+«,. 
The sum of the degrees of all the factors involving the matrix @p is called the 
partial degree of II in ap. 

A sum of products of the type II, with scalar* coefficients, is called a matrix 
polynomial in the matrices @p. The total degree of such a polynomial is defined 
as the degree of its matrix product of highest total degree. The partial degree 
of the matrix polynomial in @p is defined as that of its matrix product of highest 
partial degree in @p. The extension of the matrix polynomial is defined as the 
extension of its matrix product of greatest extension. 

The replacement of a matrix polynomial in the matrices @p by an identically 
equal matrix polynomial of lower extension will be called contraction of the 
polynomial. 

If the coefficients in a matrix polynomial are invariant under all orthogonal 
transformations, the matrix polynomial is tsotropic**. 


* Strictly, the term scalar is used here in the sense of a scalar matrix, rather 
than in its usual meaning in tensor analysis. However, if the matrix polynomial is 
isotropic, then the coefficients are also scalar in the latter sense. 

*x We might have chosen to define isotropy in the more limited sense of invariance 
under proper orthogonal transformations. For our present purposes, the differences 
which would then arise are insignificant. 
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In this paper we will make frequent use of a number of identities which are 
satisfied by 3 x3 matrices, whether or not these matrices are symmetric. These 
identities can be derived from the fundamental relation 


abe+bea+cab+bac+acb+cba=(be+cb)tra+ 
+(ca+ac)trb+(ab+ba) tre +a(trbc—trbtre) + 
+ b(trea — tretra) +¢(trab —tratrb) +I(tratrbtre — 
—tratrbe—trbtrea—tretrab+trabe+treba), 


(2.2) 


where I denotes the 3 3 unit matrix, which is satisfied [2] by any three 3 x3 
matrices a, b,c. If in (2.2) we take a=b=e, we obtain the Hamilton-Cayley 
theorem 


a — a tra+4al[(tra)?— tra?] —Ideta =0, (2.3) 
employing the relation 
det a =| (tra)? + 2tra®— 3tratra’). 
The relation (2.2) was derived from first principles in an earlier paper [2], but 


it may also be derived from the Hamilton-Cayley theorem in the manner shown 
in the Appendix. 


Taking ¢ =a in (2.2), we obtain 
aba+@b+4+ba@=a(trab —tratrb) + 


+ 3b {tra?— (tra)?] + (ab + ba) tra+a@trb + (2.4) 
+ I{tra?b — tratrab + 3 trb (tra)? — tra?}}. 


Multiplying equation (2.4) throughout on the right by a@ and on the left 
by a, adding the equations so obtained and employing the relation (2.3) to sub- 
stitute for a®, we have 

abe@’+@ba=abatra+atrab + 


De 
+ a(tra?b — tratrab) — bdeta+Idetatrb. ay) 


Multiplying equation (2.4) throughout on the left and right by a and employing 
(2.3) to substitute for a3, we obtain 
aba’ = 7aba|(tra)?— tra] — (ab+ ba) deta+a*trab 


)? 
Drs 
+ a{det a tr b — 3 [(tra)?— tra?] trab} + Idetatrab. 2) 


The results in equations (2.4), (2.5) and (2.6) have been given by RIVLIN [2]. 
Multiplying equation (2.2) on the left and right by a, we obtain 


wbea+abed+acabat+abaca+acbhbatacha 
=(abca+acbay) tra + (aca®+ @eca) trb+ (@ba+aba’)tre+ 
+ a (trbe —trbtre) +aba(trea—tretra) + (2.7) 
+ aca(trab—tratrb) + a (tratrb tre — tratrbe — 
—trbtrea—tretrab +trabe+treba). 
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Employing the relation (2.3) and relations of the type (2.5) to substitute for 
a, aca’+aca, @ba+aba’, abca+abea and a&cba+acba? in (2.7), we 
obtain the relation 


abaca+acaba=abatrac +acatrab +a(trabac— trabtrac)+ 


2.8 
[be +cb— btre—etrb+I(trb tre —trbe)] deta, 5 


which has been previously obtained [2] by another method. The method employed 
here shows that it follows as a direct consequence of equation (2.2) and hence 
of the Hamilton-Cayley theorem. 


We define the matrix polynomial G (a, b) by 
G(a, b) =a@ trb+ (ab+ ba) tra+a(trab—tratrb) + 


+ $b [tra? — (tra)?] + I {tr a?b — tratrab +4trb[(tra)? —tr a®)}. eed 
From equation (2.4), we have 
aba+a@b+ba=Gi(a,b). (2.10) 
If x denotes any 3 x3 matrix, we have 
wx b? =a(ax b?). (2.11) 


Replacing a by b and b by a in (2.10), and employing the relation so obtained 
to substitute for aa b? in (2.11), we obtain 

ax b? = —alb?ax + bazxb — G(b,az)| Bue) 
= — (ab?a) x — (aba)xb+aG(b ax). ri 


We now replace b by b? in (2.10) and use the relation so obtained to substitute 

for ab?a in (2.12). We also use (2.10) to substitute for aba in (2.12). We then 

obtain 

ax b? = [a b? + b? a? — G(a, b?)| x + [ab + ba? — Gia, b)| xb + 
+aG(b,ax)=@Cb’?x+ eax 4+ a?(bxb) + b(a ax) b — (2.13) 
— G(a, b*?) x — G(a, b) xb +aGi(b,ax). 


We now replace a and b in (2.10) by b and & respectively and use the resulting 
relation to substitute for bab in (2.13). We also replace a and b in (2.10) by 
b and a®x and use the resulting relation to substitute for b(a?x)b in (2.13). 
We thus obtain 


ez b?=—abx + baa — a (bx + x b? — G(b, x)| — [b?a? x + 


2.14 
+ @a b?—G(b, a x)| —G(a, b?) x—Gi(a,b)xb+aG(b,az). eae) 
Equation (2.14) may be re-written as 
2 b? = a G(b, x) — G(a,b) xb + 
30° x a’? G(b, x) (a, b) x (2.45) 


+aG(b,ax) — G(a, b?)x+ G(b, a’ x). 
Now, let a, b, c, x and y by 3 x3 matrices. We have 
a’ a b?y c? = (ax b) (bye?). (2.16) 
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Replacing @ and 6 in (2.10) by a and xb and by ¢€ and by and using the relations 
so obtained to substitute for a2ab and bye? in (2.16), we have 
veeb?yc?=laxba+aeba— G(a,xab)|[?by +cbyc— Gee, by)| 
—ax(bac?b)y +ax(bacb)yce+ax(bae’b)y + 
+a(bacb)yc— (axba+x ba’) Ge, by) — 
— G(a,xb) (by +cbyc) + Gla, xb) Giec, by). 
Employing relations of the type (2.10) to substitute for bac*b, bacb, ba’c*b, 
bach, axba+aba and c?by +ebye, we obtain 
@x by c= —ax[b?ac?+ac?b?— G(b,ac’)|y — 
—ax|[b?ac+acb?— G(b,ac)|ye — 
— «[ba?c?+ a cb? — G(b, a®c?) | y — 
—a[b?a?c + a’c b?— G(b,a’e)| ye + 
+ |@ab — Gia, xb)| G(e, by) + 
+ G(a,xb) [by c?— G(e, by)| + G(a, xb) G(e, by) 
=— (axrbv®a+2b'a’) (Py +ecye) — 
—(axa+ xa?) (ce? b?y + cb? yc) +a’x b Ge, by) + 
+ G(a,xb) byc?+axG(b,ac)yce+ax G(b, ac?) y + 
+2G(b,a*c)yc+2G(b,a’c*) y — G(a, xb) Ge, by). 


(2.17) 


Again employing relations of the type (2.10) to substitute for axb?a+ab?a?, 
cy +cye, axa+xa and c?b?y +cb?ye in (2.17), we obtain 


wx by c? = — |aa b?— Gia, xb*)| (ye? — G(e,y)| — 

— [@ax — Ga, x)| [b?y ce? — Ge, b?y)| + 
+a@a2bGi(e, by) + Gia, xb) byc? + (2.18) 
+axG(b,ac)yc+ax G(b, ae?) y + 
+aG(b,a@c)yc+x2G(b, ac?) y — Gia, xb) Ge, by). 


Whence, 

3 xe bPye=—elxb?Gi(cy) + G(a,x2) Peye+ w@xb Ge, by) + 

+ G(a,ab) bye +axG(b,ac) ye+ @a G(e, b?y) + 
+G(a,¢4b*)ye+axG(b,ace’)y+xG(b,@ec)ye+ (2.19) 
+ at G(b, ac?) y — G(a, xb*) Ge, y) — 

— G(a, x) G(c, b?y) — Gia, xb) G(e, by). 


We note that a different expression for 3a2ab2yc?, when «J, is obtained 
from (2.15) by multiplying the equation throughout on the right by ye’. 
Taking #=y—=T in (2.19) and noting, from (2.9), that 


GED) Sew Gana ta, 
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we obtain 
3a° bc? = — a®b Gc, b) — Ga, b) bc? + 2@G(e, b?) + 
+ 2G (a, b?) c? —aG(b,ac) ¢ —aG(b, ac?) — (2.20) 
— G(b, ac) ec — G(b, a’c?) + G(a, b) G(e, b). 


We now consider the product ab?a?y, where a, b and y are 3 x3 matrices. 


We | 
Say abaty =ab*(a’y). (2.21) 


Replacing b in (2.10) by y and using the resulting relation to substitute for a?y 
in (2.21), we obtain 


ab?a’y =—ab?|—aya—ya?+ Gia,y)| 
Paap 
= —a(b®ay)a—a(bya’) + ab’? G(a,y). eee 


Again, using a relation of the type (2.10) to substitute for a(b?ay)a in (2.22), 
we obtain 

ab’a’y =a? (b?ay) + (b? ay) a — G(a, b?ay) — a(b? ya’) +ab?G(a,y), 
Ves 
ab?a’y — a’? bay = b* (ay) a —a(b?y) a — Gia, b?ay) +ab?G(a,y). (2.23) 
Now, in (2.15) we replace a, x and b by Db, ay and a respectively and use the 
relation so obtained to substitute for b?(ay)a? in (2.23). Also, in (2.15) we 
replace a, x and b by b, y and a respectively, and use the relation so obtained 
to substitute for b?ya? in (2.23). We thus obtain 
abay —@b?ay = 4|b?G(a,ay) — G(b,a)aya+ 

+bG(a, bay) — G(b,a*)ay+aG(b,a) ya — 


(2.24) 
—abG(a, by) +aG(b, a’) y —a Ga, b?y) — 
— 2G (a, b?ay) + 2ab? G(a,y)|. 
With the notation 
Ha, b) =abatra+a@trab+ 
Dy Ja 
+ a(tra?b — tratrab) — bdeta+JIdetatrb, Cee) 
equation (2.5) may be re-written as 
aba4+ a@ba= Ha, b). (2.26) 


Replacing b in (2.26) by 6?, and multiplying the relation so obtained on the 
seme eve abba’y +ab?ay = Ha, b’) y. (2.27) 
From equations (2.24) and (2.27), we obtain 

2ab2a?y = H(a, b?) y + 4[b? G(a, ay) — G(b,a)aya-- 
+bG(a, bay) — G(b,a*)ay +aG(b,a) ya — 
—abG(a, by) +aG(b, a’) y —aG(a, b?y) — 
— 2G (a, Bay) + 2ab?G(a,y)| 
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ae 20 bay = H(a, b?) y — 4(b?G(a,ay) — G(b,a)aya + 


+bG(a, bay) — G(b,@)ay + aG(b,a)ya — 
—abG(a, by) + aG(b, a’) y — aG(a, b’y) — 
— 2G (a, b?ay) + 2ab? Ga, y)]. 


(2.29) 


Analogous expressions may be derived in a somewhat similar manner for the 
matrix products yab?a? and ya?b?a. Alternatively, such expressions may be 
derived from equations (2.28) and (2.29) by equating the transposes of both 
sides of the equations. We then make use of the fact that the transpose of a 
product of matrices is equal to the product of the transposes of these matrices 
with the order of the factors reversed. Finally, replacing the transposes of the 
matrices a,b and y by a, b and y respectively, the desired expressions for 
ya? b?a and yab?a? are obtained. 

Provided that #+J and y=+I, equations (2.15), (2.19), (2.20), (2.28) and 
(2.29) express the matrix products a’xb?, a?ab*yc?, a®b’c?, aba’y and 
a? b2ay respectively as matrix polynomials of lower total degree. 


3. The contraction of matrix polynomials in R, 3 X 3 matrices 
The contraction of matrix polynomials in Rk, 2 x2 matrices and in two, 3 x3 
matrices has been treated by Rrviin [2]. In this section, we discuss the con- 
traction of matrix polynomials in R, 3 x3 matrices ap (P=1, 2,..., R). The 
particular cases when R =3, 4 and 5 will be discussed in greater detail in § 4. 


Any matrix polynomial P in Rk, 3 x3 matrices is the sum of matrix products 
of the type II, given by equation (2.1), with scalar coefficients. If any of the 
indices %, %),..., %, occurring in the product II is greater than two, then we can 
use the Hamilton-Cayley theorem to express II as a matrix polynomial in which 
none of the matrix products has a factor of degree greater than two. In this 
way we readily obtain 


Lemma 1. Any matrix product II of the R, 3X3 matrices ap (P =1, 2,..., R) 
can be expressed as a matrix polynomial, with extension equal to that of II, in which 
each of the matrix products is either the unit matrix I or is formed from some or 
all of the factors ap, a> (P =1, 2,...,R) and has partial degree in ap less than 
or equal to that of II. 

Each matrix product in the matrix polynomial, which is not J, therefore takes 
the form (2.1) with each of the «’s either 1 or 2. If any factor in such a matrix 
product occurs twice, then a relation of the type (2.4) can be used to express 
the matrix product as a matrix polynomial of lower extension. Let us consider 
a matrix product in the matrix polynomial of the form 


ar atean : 
Ya LARZ OF YAxRAgzZ, (3.1) 


where y and @ are either matrix products or the unit matrix J and & is a matrix 
product. Then, replacing @ and b in (2.4) by a? and x respectively or by ax 
and # respectively, we can express either of the products (3.1) as a matrix poly- 
nomial of lower extension. Using Lemma 1, we see that this may in turn be 
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expressed as a matrix polynomial in which each matrix product is formed from 


some or all of the factors ap and a’. By repeated application of this procedure 
we obtain 


Lemma 2. Any matrix polynomial P in the R, 3 x3 matrices ap may be expressed 
as a matrix polynonual Q of lower or equal extension and total degree, each matrix 
product in which has partial degrees in each of the matrices ap lower than or equal 
to those of P, is either I or is formed from some or all of the factors ap, a? (P =, 
2,..., R) and does not contain two identical factors. 


Suppose a matrix product in the polynomial Q has the form 
yarweare —~(K+ 1), (3.2) 


where yxz is a matrix product formed from some or all of the factors ap (P =1, 
Zeek) and. ay (O41, 2,.2..,2% O=-K, L),.no factoroccurring twicein the 
product, and «+ J. Replacing a and b in (2.15) by ag and a, and using the 
relation so obtained to substitute for aj; aaj in (3.2), we see that the product 
(3.2) can be expressed as a matrix polynomial of lower total degree in the matrices 
a@p than (3.2). Each matrix product in this polynomial is of lower or equal partial 
degree in each of the matrices ap than (3.2). By repeated application of this 
procedure and employing lemma 2 we obtain 


Lemma 3. Any matrix polynomial P in R, 3 x3 matrices Ap can be expressed 
asamatrix polynonual R of lower or equal total degree, in which each matrix product 
ts of lower or equal partial degree in each of the matrices ap than P, ts either I or 
is formed from some or all of the factors ap (P =1, 2,..., R) and at most two of 
the factors ay (Q =1, 2,..., R), no matrix product contains two identical factors 
and any matrix product containing two factors a% contains them consecutively. 


We obtain immediately the 


Corollary. Any matrix polynomial P in R, 3 x3 matrices Ap can be expressed 
as a matrix polynomial of total degree <R+4 and of extension <R-+2. 

Asa result of Lemma 3, we see that any matrix polynomial P in the R matrices 
@p» can be expressed as a matrix polynomial R, in which each matrix product 
has one or other of the forms 


yaxa;e (K+L), (3-3) 
or 

y ay Z (3.4) 
or 

Ye, (3.5) 
where y and @ are either J or matrix products formed from some or all of the fac- 
tors @p (P =1, 2,..., R) such that no two factors in yz are the same. 


We consider first a product of the form (3.3). Four possibilities arise which 
may be considered separately : 
(i) ax is a factor of yz, but a; is not a factor of yz, 
(ii) a, is a factor of yz, but ax is not a factor of yz, 
(iii) ax and a, are both factors of yz, 
(iv) neither ax nor ay is a factor of yz. 
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In case (7) we may write the matrix product (3.3) in one of the forms 


Y Ay V Ax A; 2, (3.6) 
a YAK A, UV AK z, (3.7) 
is Y Ai Ay AK Z, (3.8) 


where y, v and 2 are matrix products formed from some or all of the factors ap 
(P =1, 2,...,R; P+, L) such that, in (3.6) and (3.7), no two factors in yv2 
are the same and, in (3.8), no two factors in yz are the same. As special cases, 
y or 2 or both may be the unit matrix J, but y+. If the matrix product has 
the form (3.6), then taking a=a, and b =v in equation (2.5), and multiplying 
the resulting equation throughout on the left and right by y and azz respectively, 
pect ya,varaiz=—yaxvaxa,z+P, 

where PB is a matrix polynomial in which each of the matrix products is of 
lower total degree than (3.6) and of partial degree in each ap (P=1, 2,..., R) 
less than or equal to that of (3.6). Taking a=ax, r=vax, b=a, in (2.15), 
we can express ya;va,a;z2 and hence the matrix product (3.6) as a matrix 
polynomial in which each of the matrix products is of lower total degree than 
(3.6) and of partial degree in each ap (P =1, 2, ..., R) less than or equal to that 
of (3.6). By an analogous procedure, taking a =a, and b=ajv in (2.5) and 
a=a,, £=v, b=ax in (2.15), we see that (3.7) may be expressed as a matrix 
polynomial in which each of the matrix products is of lower total degree than 
(3.7) and of partial degree in each @p (P =1, 2, ..., R) less than or equal to that 
of (3.7). If the matrix product has the form (3.8), we take a=a,x, b =a, and 
y =z in (2.29) and use the resulting relation to express (3.8) as a matrix poly- 
nomial in which each of the matrix products is of lower total degree and of lower 
or equal partial degree in each ap (P =1, 2,..., R) than (3.8), provided that 2+ J. 
When z =IJ, (3.8) takes the form yaxajzax, which may be expressed as a matrix 
polynomial in which each of the terms is of lower total degree and lower or equal 
partial degree in each ap (P=1,2,..., R), by means of a relation analogous to 
(2.28), unless y=J, in which case (3.8) becomes aj ajax. Thus, any matrix 
product of the form (3.3), which satisfies the conditions of case (7), except 
aay a,, can be expressed as a matrix polynomial in which each of the matrix 
products is of lower total degree and of lower or equal partial degree in each 
onithecmatrices dp (2? ==472) 05k) 


In case (77) we may write the matrix product (3.3) in one of the forms 


yakal va, 2, 6.9) 
or aa 
Ya, Vax a, @, (3.10) 
oe 2 a2 
Ya, aK a, @, (3.14) 


where in (3.9) and (3.10) y,v and 2 have the same meanings as in (3.6) and 
(3.7), and in (3.11) y and 2 have the same meanings as in (3.8). In a manner 
analogous to that employed in discussing the matrix products (3.6), (3.7) and 
(3.8), we can show that each of the matrix products (3.9), (3.10) and (3:41); 
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with the exception of a,aaz, can be expressed as a matrix polynomial in which 
each of the matrix products is of lower total degree and of lower or equal partial 
mesree in each of the matrices:ap4P = 152, :. ., Rk). We note also, from (2.26), 
that Ca Lo a 2 2 2 
Wk Ay Ax = — Ax a, Ak + H (ax, a7), 

where H(qa, b) is defined as in (2.25). H(a, b) is, of course, a matrix polynomial 
in which each of the terms is of lower total degree and of lower or equal partial 
degree in a and b than aba? and a®ba. 


In case (771), we may write the matrix product (3.3) in one of the forms 


Y Ag V Ax A, UA, Z, (3:42) 
or Bart 

Ya, Vax a, Uae, (3.13) 
or Me 

Ya, VaR A, Age, (3.14) 
or 

Y A, A, A; U Ag Z, (3.15) 
or 

Y A, Ay A) Ag Z. (3.16) 


In these matrix products y, v, u, are matrix products formed from some or 
alljof the facters ap (P =14, 2,...., R; P=-K, L) and y and 2 may be the unit 
matrix I, while u+-J and v+TI. In (3.12) and (3.13) no two factors of youz 
are the same; in (3.14) and (3.15) no two factors of yvz are the same and in 
(3.16) no two factors of yz are the same. Each of the matrix products (3.12) 
to (3.16) may be expressed as a matrix polynomial in which each matrix product 
is of lower total degree and of lower or equal partial degree in each of the matrices 
a \P==1,2,...,). For-eaeh of the matnx products (3.12) to (3.45), this can 
be shown by employing equations (2.5) and (2.15) in a manner similar to that 
adopted in discussing the matrix products (3.6), (3.7), (3-9) and (3.10). Thus, 
for the matrix product (3.12) we take a=a,, b =v in equation (2.5) anda=a,, 
x—va,, b=a, in equation (2.15). For (3:13}-and (3.14), we take a=a,, 
b =va?, in (2.5) anda=a,, #=v, b=ax in (2.15). For (3.15) we take a=ax, 
b =ajv in (2.5) anda=a,, x =v, b =ax in (2.15). We now consider the matrix 
product (3.16). We take a=ax and b =a7j in (2.5) and multiply the resulting 
equation throughout on the left and right by ya, and @ respectively, to obtain 


9 
Y Gy 03. A} Og 2 = — Ya, ag a az + P, (3.47) 


where P, is a matrix polynomial in which each of the matrix products is of 
lower total degree than (3.16) and of partial degree in each ap (P =1, 2,..., R) 
less than or equal to that of (3.16). Again taking a=a, and b=ax, in (2.5) 
and multiplying the equation so obtained throughout on the left and right by 
y and azz respectively, we obtain 


2 2 
YG, Ag A Ake = — YO, Ag a, az + B (3.48) 


where P, is a matrix polynomial in which each of the matrix products is of 
lower total degree than (3.16) and of partial degree in each ap (P =1, 2, ..., R) 
less than or equal to that of (3.16). Combining (3.17) and (3.18), we have 


2 2 2 ; 
Y A, A; 07 Ag % = YA) Ag A, az + P,— PB. (3.19) 
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Taking a=a,, #©=a,xa,, b=ax in (2.15), we see that the matrix product 
(3.16) can be expressed as a matrix polynomial in which each of the matrix 
products is of lower total degree than (3.16) and of partial degree in each @p 
(P =1, 2,..., R) less than or equal to that of (3.16). 

We now consider a matrix product of the form (3.4). This may be written 
in one or other of the forms 


YAK VAK Zz, (3.20) 
or Y Ap VAxz, (3.21) 
or yake, (3.22) 


where y, © and 2 are matrix products formed from some or all of the factors 
a@p (P=1, 2,...,R; P+ K) such that, in (3.20) and (3.21), no two factors of 
yvz are the same and, in (3.22), no two factors of yz are the same. As special 
cases, y and 2 may be the unit matrix J, but v+J. We note that by taking 
a—a,x and b=v in (2.5) and multiplying the resulting equation throughout on 
the left and right by y and 2 respectively, we obtain 


2 j 2 
YA Vane = — YA vaxrz+P, 


where P, is a matrix polynomial, in which each of the matrix products is of 
lower total degree than (3.20) and of partial degree in each ap (P =1, 2,..., R) 
less than or equal to that of (3.20). 

Combining the foregoing results, we readily obtain 


Theorem 1. Any matrix polynomial P in R, 3 x3 matrices Ap (P =1, 2, ..., R) 
can be expressed as a matrix polynomial of lower or equal extension and total degree, 
in which 


(t) each matrix product is either I or 1s formed from some or all of the factors ap 
(P:=1; 255.3, R) and at most two of the factors: at(P =1,2)-85 ies 
(11) no two factors in a single matrix product are the same; 


(111) each matrix product 1s of lower or equal partial degree in each of the matrices 
@p (P=1,2,..., R) than the matrix polynomial P; 


(iv) matrix products containing two of the factors ap (P =1, 2,..., R) contain 
them consecutively ; 


(v) no matrix product containing both of the factors a%, and a? contains either 
of the factors Ax and a,, unless it has the form axa? a>; 

(vt) Ag precedes ak in any matrix product containing both ax and aj, as factors. 

From the conditions (i), (ii) and (v) in Theorem 1, we obtain immediately the 

Corollary 1. Any matrix polynomial in R, 3 x3 matrices ap can be expressed 
as a matrix polynomial of extension <R-+1 and of total degree <2 if R=1, of 
total degree <5 if R=2 and of total degree <R+2 if R>2. 

If the matrix polynomial P is symmetric, then 

P>=3(P +P’), 


where P’ is the transpose of P. Then if P contains the term a,aza with a 
scalar coefficient, this term may be replaced by the term 


¥ (Ag a] ak + a ah, ax) (3.23) 
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with a scalar coefficient. Taking a=ax, b =aj, in (2.5), we see that (3.23) can 
be expressed as a matrix polynomial of total degree 4 and extension 3. Combining 
this result with Theorem 1, we have 


Corollary 2. Any symmetric matrix polynomial in R, 3X3 symmetric matrices 
a» can be expressed as a symmetric matrix polynomial of extension <R-+1 and 
of total degree <R+2. 


4. The contraction of matrix polynomials in five or fewer 3 x 3 matrices 


Matrix polynomials in two 3 x 3 matrices. We consider a matrix polynomial 
P in the two 3 x3 matrices a and b. It follows from Corollary 1 to Theorem 1, 
that P can be expressed as a matrix polynomial of total degree <5 and extension 
<3. From Theorem 1, we see that the matrix products in this polynomial may 
be formed from the factors a, b, a?, b? subject to certain limitations. We list 
below all the products of total degree <5 which can be formed from the factors 
a, b, a®, b?, subject to the limitations imposed by Theorem 1, according to their 
total degrees: 


total degree 0 I, 

total degree 1 i, (0), 

total degree 2 a’, b?, ab, ba, (4.1) 
total degree 3 a’b, b?a, ab?, ba’, 

total degree 4 a’? b?, b?a?, aba’, bab?, 

total degree 5 ab?a*, ba’ b’. 


We now consider that a and b are symmetric 3 x3 matrices and that P is 
a symmetric matrix polynomial in these. Then 


P=43(P+P), pan 


where P’ is the transpose of P. Now the transpose of any matrix product of 
symmetric matrices may be obtained by reversing the order of the factors in 
the product. Thus, the transpose of ab is ba, the transpose of ab is ba? and 
so on. We thus see that if P is a symmetric matrix polynomial in the two sym- 
metric matrices a and b, it may be expressed as the sum of the following terms 
with scalar coefficients 
Ls 
a, b, 
a?, b?, ab+ ba, 
eb+ba, ab?+ba, 
a@b?+ ba, aba+aba, bab*?+b'ab, 
abkak+@b?a, bab? + b2a’b. 

Now, from (2.5) and (2.4) and the relations obtained from these by replacing 


a and b by b and a, by a and B? and by b and @”, together with the Hamilton- 
Cayley theorem, we can express aba’?--a?ba, bab?+ bab, ab?a@’+aba 
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and ba2b?-+-b2a2b as matrix polynomials consisting of the terms J, a, b, a*, b?, 
ab-+-ba, @b+ba?, ab?+6?a and a’?b?+- b2a2 with scalar coefficients. We 
thus have the result, previously obtained in [2], that any symmetric matrix 
polynomial in the two symmetric, 3 x3 matrices a and b, may be expressed as 
the sum of the following terms with scalar coefficients: 


: ve, b ab+ba, (4.4) 
web+ba?, ab?+ ba, 
a’? b? + b2a?. 


Matrix polynomials in three, 3 xX 3 matrices. If P is a matrix polynomial in 
three, 3 <3 matrices a, b, ¢, it follows from Corollary 1, to Theorem 1 that it 
can be expressed as a matrix polynomial of total degree <5 and extension <4. 
From Theorem 14, it is seen that the matrix products in this polynomial which 
involve only two of the three matrices a, b, e may be limited to those given in 
(4.1) together with those obtained from (4.1) by permutations of a, b and e. 
The matrix products which involve all three matrices may be taken as those 
listed below, in order of their total degrees, together with those formed from 
these by permutations of a, b, ec: 


total degree 3 abe, 
total degree 4 ebc, bate, bea, (4.5) 


total degree'5. sa*b%e,/ ca?b*) vabecer abheaes bacw: 


By an argument similar to that employed in discussing symmetric matrix 
polynomials in two symmetric matrices, we see that if a, b and € are symmetric 
3 x3 matrices and P is a symmetric matrix polynomial in these, then it can be 
expressed as a matrix polynomial consisting of the terms 


abe+eba, 
wbe+eba, b&e+cab, bea+acb, 


ebe+cha, c@b+bvac, abae+caba, 8) 


abca+a@cba, baca+aecab, 


and the terms formed from (4.4) and (4.6) by permutations of a, b, e, each of 
the terms having a scalar coefficient. In (4.4), the terms b, b? and ab2- b2a 
can be formed from a, a? and a?b + ba? respectively, by a permutation of a, b,c. 
In (4.6), the terms bea?+a2cb and ca?b?2+ b2a2e¢ can be formed from a2be aL 
cba’ and a’b’c + eb? a? respectively by permutations of a, b, ¢. Also, replacing 
a and b by a and € respectively in equation (2.5), we can express baca?+a?cab 
as the sum of — (ba?ca + aca?b) and a matrix polynomial of lower total degree 
than 5. We note also that ba?ca+aca?b can be formed from aba2e -ca2ba 
by a permutation of a, b, e. 
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We thus see that any symmetric matrix polynomial in the three symmetric 
matrices a, b, c may be expressed as the sum of the following terms and terms 
formed from these by permutations of a, b, ¢, with scalar coefficients: 


I, a, @, ab+ba, @b+ba@, a@b?-+ ba, 
abce+cba, 
@’boe+cbad, bae+ecab, (4.7) 
ab?e+cba’, 


ab@c+ea@ba, abcad’+acba. 


Matrix polynomials in four, 3 x 3 matrices. We now consider a matrix 
polynomial P in the four 3 x3 matrices a, b, e,d. From the corollary to Theo- 
rem 1, we see that P can be expressed as a matrix polynomial of total degree 
<6 and extension <5. From Theorem 1, it is seen that the matrix products 
in this polynomial which involve three or fewer of the matrices a, b, c, d may 
be taken as those given by (4.1) and (4.5) and the matrix products formed from 
these by permutations of a, b,c,d. Also, the matrix products which involve 
all four matrices may be taken as those listed below, in order of their total degrees, 
together with those formed from these by permutations of a, b, e, d: 


total degree 4 abed, 
total degree 5 abed, abted, abe?d, abcd’, 
total degree 6 ab?cd, abte’?d, abe?d?, 


4.8 
abcda’*, bacdad, beada’, a 


abead, baca‘d, 
aba@ecd. 
By an argument similar to that employed in discussing symmetric matrix 
polynomials in two and three symmetric matrices, we see that if a, b,e and d 
are four symmetric 3 x3 matrices and P is a symmetric matrix polynomial, 


then it can be expressed as a matrix polynomial consisting of the terms (4.7), 


the terms 
abed+deba, 


wbced+debad, ab’cd+deb’a, abed+deba, abcd@’+dcba, 
e’beed+dcebid, abbed+de’ba, aberd4+dec’ba, 
abed@+adcba, bacd@’+@dcab, beadad’+adacb, 4-2) 
abc@d+dadcba, baca’d+da@cab, 
abacd+dea*ba, 


and the terms formed from (4.7) and (4.9) by permutations of a, b, ¢,d, each 
of the terms having a scalar coefficient. 

In (4.9), the terms abcd?+d’cba, abe’d +dc’ba and abe’d’+d’c’?ba 
can be obtained from the terms a2bed +dcba?, ab?cd + dcb2a and a?b?ced + 
dcb?a? respectively by permutations of a, b, c,d. Also, by employing relations 
obtained from (2.5) by appropriate substitutions for a and b, we can obtain 
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equations of the form 
abcadd +dacba=— (@bcad+dacb@) +Q,, 


abacd+deaba= — (@bacd +dcaba’) + Q,, 


where Q, and Q, are matrix polynomials of total degree <6. We note that 
«wbcad +dacba and a?bacd +deaba’ can be obtained from bacda+ 
adeab and beada?+a2dacb respectively by permutations of a, b, ¢, d. 
We thus see that any symmetric matrix polynomial in the four symmetric 
3X3 matrices a, b, c, d may be expressed as a matrix polynomial consisting of 


the terms (4.7), the terms abced+deba, 


@e@bced+deba, abécd+dcecba, 
e@btcdt-deb'ad, abbeed+de’?b@a, (4.10) 
abeda+a@dcba, bacdadiaddcab, beada+adacb, 
bace@d+dacab, 
and the terms formed from (4.7) and (4.10) by permutations of a, b, c, d, each 
of these terms having a scalar coefficient. 


Matrix polynomials in five, 3 x 3 matrices. Finally, we assume that P isa 
matrix polynomial in the five, 3x3 matrices a, b,c,d,e. As in the previous 
case, we see that P can be expressed as a matrix polynomial of total degree <7 
and extension <6, in which the matrix products are those given by (4.1), (4.5) 
and (4.8) and by (4.11) below, together with the matrix products formed from 
these by permutations of a, b, ¢, d, e. 


total degree 5 abcde, 
total degree6 abcde, ab?cde, abc?de, abcd@e, abcde’, 
total degree7 a@’?b?ede, abbe?de, abe?d@e, abcde’, 
abcdead, abcda@e, abcaede, aba@cde, (4.11) 
bacded@, bacda@e, bacadde, 
beaded, beadae, 
bedaead. 

We now assume that P is a symmetric matrix polynomial in five symmetric 
3x3 matrices a, b, ¢,d,e. Then, in a manner similar to that employed in dis- 
cussing symmetric matrix polynomials in four 3 x3 matrices, we can show that 
P may be expressed as a matrix polynomial, the terms in which are those listed 
in (4.7) and (4.10), the terms 

abcede+edcba, 


wbede+edcebad, abicde+edcb2a, ab@de+edeba, 
web>cde-edevea, abkede+edebea, (4.12) 
abcde@|@edecba, abcd@e+eadeba, abc@de+edacba, 


aba@cde+edcaba, bacd@e+eadcab, bac@de+edacab, 


and the terms formed from (4.7), (4.10) and (4.12) by permutations of a, b, ¢, 
d,e, each of these terms having a scalar coefficient. 
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5. Scalar invariants under the orthogonal group of Rsymmetric 3 x 3 matrices 


In this and the following four sections, we shall discuss the determination 
of integrity bases for absolute invariants of R symmetric 3 x3 matrices under 
the group of orthogonal transformations. It will be seen that both the argument 
and results are independent of whether it is the full or proper orthogonal group 
that is considered. Since we shall be concerned only with absolute invariants, 
we shall, for brevity, generally omit the qualification ‘‘absolute’’. 

Let us define the elements of R, 3 x3 (not necessarily symmetric) matrices 
reid 2a. Ie) Uys 


ap = ||ar;'||- (5.1) 
Any matrix product II of the R matrices ap may be expressed in the form 
II = a), [I*, (5.2) 


where II* is itself a matrix product of the matrices ap of total degree one less than 
that of II. 


Now, from Theorem 1, the matrix product II* may be expressed as a matrix 
polynomial of lower or equal partial degrees in each of the matrices and of lower 
or equal extension, in which each matrix product takes one of the forms 


dnd ao (ee Dy) 
yapaxz (K+L), (5.3) 
varw, 
or 
u 


? 


where y and 2 are matrix products formed from some or all of the factors ap 
(P =1, 2,...,R; P+K, L), v, w and u are matrix products formed from some 
or all of the factors @p (P =1, 2,..., R), no two factors in yz, in vw or in u 
being the same. y, 2, v, w and wu may, as particular cases, be the unit matrix I. 
Thus, denoting the matrix products (5.3) by @, («=1, 2,...), we have 


TI* = D1 Ya Oe; (5.4) 


where the coefficients 7, are scalars, which, from the manner in which Theorem 1 
is derived, are readily seen to be expressible as polynomials in traces of matrix 
products formed from ap (P=1,2,...,) of partial degrees in each of the 
matrices less than or equal to those of II*. 


From (5.2) and (5.4), we obtain 


Die ,,. (5:5) 


Thus, © 
1D = yet 6, ys (5.6) 


By successively repeating this procedure for each of the traces of matrix products 
in terms of which the coefficients y, are expressed, it is seen that trIZ must be 
expressible as a polynomial in the expressions tr @,0),,.. We thus obtain 
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Lemma 4. The trace of a matrix product formed from the R, 3X3 matrices Ap 


(P =4, 2,..., R) may be expressed as a polynomial in expressions having the forms 
trdydgzaza, (K+L), (5.7) 
tray, 0, Or 2 (KL), (5.8) 
tr dy va; Ww (5.9) 
and 
tl ML abs (5.10) 


which have partial degrees in each of the matrices Gp less than or equal to the partial 
degrees of trII, where y and z are matrix products formed from some or all of the 
factors Gp (P=1,2,...,R; P+K,L), v,w and u are matnx products formed 
from some or all of the factors dp (P=1, 2,..., R), no two factors in yz, in vw 
or in u being the same, and, in particular cases y, z,v,w and u may be the unit 
matrix I. 

We note that the trace of any matrix product of R matrices ap is a scalar 
invariant of the matrices under all orthogonal transformations. In particular, 
the expressions (5.7), (5.8), (5.9) and (5.10) are scalar invariants. 

In the case when the R matrices @p are symmetric, we have [2] 


Lemma 5. Any polynomial scalar invariant, under either the full or the proper 
orthogonal group, of any number of symmetric matrices may be expressed as a poly- 
nomial in traces of matrix products formed from these matrices. 

It follows, of course, from Lemmas 4 and 5 that the set of invariants (5.7), 
(5.8), (5.9) and (5.10) forms a finite integrity basis for the R symmetric 3 x3 
matrices @p under either the full or the proper orthogonal group. 


6. Some relations concerning the reducibility of scalar invariants of symmetric 
3 x 3 matrices 


If J, and J, are two scalar invariants for the set of R (not necessarily sym- 


metric) 3X3 matrices ap (P=1, 2,..., R) of equal partial degrees in each of 
the matrices and J, may be expressed in the form 
Chae ee (6.1) 


where P is a polynomial in invariants of total degree less than that of J, and J,, 
we say that J, is equivalent to J, and write 


L=_,. (6.2) 

I,— I, is itself a scalar invariant of the set of matrices and is said to be reducible. 
We indicate this by 

I,—I,=0. (6.3) 

In the present paper, we shall derive a number of relations between traces 


of matrix products formed from the R matrices @p, which are of value in es- 


tablishing the equivalence between various invariants of the matrices and the 
reducibility of others. 
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It is easily seen that 


Or gy % lo be Oe gua gy Xe Cpe Oe_1 gyhe gy % Cpa, ae 
trap ay ...a% = tr ay OF A... jet = tr af ate an... afet =-+-, (6.4) 
where each of the subscripts 7,, 72, ..., 7, is one of the numbers 1, 2,..., R and 


the indices a, %),.:.,, are positive integers. We thus have 


Lemma 6. The trace of a matrix product formed from 3 <3 matrices ts unaltered 
by cyclic permutation of the factors in the product. 

Now, suppose that the matrices ap (P=1,2,.:.,R) are symmetric 3 x3 
matrices. We see that a? ... a#:a} is the transpose of a} a}... a¥. Since the 
trace of any matrix is equal to the trace of its transpose, we have 


Lemma 7. The trace of a matrix product formed from symmetric 3 <3 matrices 
is unaltered if the order of the factors in the product 1s reversed. 


A number of further relations between traces of matrix products formed 
from 3 x3 matrices can be obtained from the relations derived in § 2. Let a, b, € 
be three 3 x3 matrices and let y and z be any 3 x3 matrices (in particular cases J). 
Then multiplying (2.2) throughout on the left by y and on the right by 2, we 


obtain tryabez+trybeaz+trycabe+ 


6.5 
+trybacz-+tryacbz-+ trycbaz=0, ee 
unless y—<2— J. 
Taking a=b =c in (6.5), we obtain 
try@z=0, (6.6) 
unless y=2=1. 
Again, taking c=a in (6.5), we obtain 
tryabaz-+tryb@z4try@’bz=0, (6.7) 
unless y=2= I. 
From equation (2.5), we obtain 
tryaba’z+try@baz=0, (6.8) 
even if y=z2 =I. 
From equation (2.6), we obtain 
try@ba’z=0, (6.9) 


even if y=z2=I. 
Now, let # (++) be any 3 x3 matrix. Multiplying (2.15) throughout on the 
left by y and on the right by 2, we have 
try a’a'b*z = 0, (6.10) 
tinlecsu) 2 —J, 
Multiplying (2.15) throughout on the right by ye’, we obtain 
trvae bryce? = 0. (6.11) 


Again multiplying (2.20) throughout on the left by y and on the right by z, 


we obtain try a2b?2c2z =0, (6.12) 


unless y=Z=I. 
22* 
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If y =z =I, on taking the trace of both sides of (2.20), we have, with Lemma 6, 
trab?c?+tre’?b'a=0. (6.13) 

If a, b and © are symmetric matrices, we obtain from (6.13) and Lemma 7 
trebc?= 0. (6.14) 
Finally, multiplying equations (2.28) and (2.29) on the left by 2, we obtain 


trzab*ey =0 (6.15) 


as trza*b?ay=0, (6.16) 


unless y=I or 2=1. 


7. The reduction of the invariants for R, 3 X 3 matrices 


In this section, we shall use the results of § 6 to investigate the reducibility 
of the invariants (5.7), (5.8), (5.9) and (5.10). 

We first consider invariants of the form (5.7), viz. traya@,gazaz. As a 
special case, we may have M=L, but if M=K, the invariant is tr a; aj ax, 
which by Lemma 6 is trax az and is therefore reducible. 

Next we consider invariants of the form (5.8). These have total degree =< R +3. 
We consider the three cases M=K, M=L, M+K,L separately. 


Case M=Kk. We have, from (6.8), 
tra, y a2 at e= — traryar azz, 


which, from (6.10), is reducible unless y=. If y=TI, (5.8) takes the form 
tr dy a2, ai, z. Then, either =I so that, using Lemma 6, (5.8) may be written 
tr a?.a?., which is reducible; or  =a@p and (5.8) becomes tr dx a7, a @p, which 
may be written as tr @p ax aj a% and then has the form (5.7) discussed above; 
or, finally, 2=apw (w=-I). In this latter case, using Lemma 6, (5.8) may be 
expressed in the form trwa, aj aj ap. Replacing z,a,b and y by w, ax, a, 
and ap respectively in (6.15), we see that tr wa, aj aj ap (w+1) is reducible. 
Case M=L. We have, from (6.8), 


9 6 
ttapy a a2 = — 41a) ua, are. 


which is reducible unless g =F. Since, from Lemma6, tr a7 y a, az—tr y aaj; a? , 
we see, as before, that tra, y aja; 2=0 unless ¢=J and y=ap. 


Case M=-K, L. If ay is not a factor of y or 2, then (5.8) must of necessity 
have total degree <R-+2 and must be expressible in the form tr ya?.a% where 
y is a matrix product formed from some or all of the factors Op (P= 2, Sees 
P=K,L), no factor being repeated. If ay, is a factor of either y or 2, it is easily 
seen that (5.8) is reducible, by using one or more of the results expressed by 
(6.7), (6.10), (6.12) and Lemma 6, depending on the position of the factor ay 
in y or 2. 

We now consider invariants of the form (5.9). These have total degree 
<k-+3. Ifa, is not a factor of either w or w, (5.9) may be expressed in the form 
tr vak, where v is a matrix product formed from some or all of the factors Gp 
(P =1, 2,..., R), no factor being repeated. If ay, is a factor of v or w, then either 
M=K, or M+K. We shall treat these two cases separately. 
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Case M=K. If ax is a factor of v, the invariant (5.9) takes either the form 
tr ak wax w (Us) or trax vax wax w («©+1,u+I). From (6.9), we see that 
the first of these forms is reducible. We have, from (6.7). 

tr dg Ha, Ua w = — tra, xuaz,w — trxeauarw, 
and, from (6.9), each of the expressions on the right-hand side is reducible. If 
ax is a factor of w, the invariant (5.9) takes the form tra,va;raxu (w+1). 
This is seen to be reducible in a manner similar to that employed in discussing 
the reducibility of trayra,uazw. 


Case M+ K. If ay, is a factor of v, then (5.9) takes either the form tr a},x7az-w 
or tr dy Uayxazw (u+J). In the first case the form is reducible, from (6.10), 
unless # =I or w=I, in which case it may be written tr a3,a%-w or tr aj,xaxz,, 
which from Lemma 6 is tr wa;a},. In the second case, we have, from (6.7), 


trayUa,xayw = — trai,uxazw —truad, «caw 


and from (6.10) we see that both terms on the right-hand side are reducible 
provided that «== and w+I. If «=I, the first term on the right-hand side 
takes the form —traj,uazw (u-+J) and the second term takes the form 
—truajazw (u-+1). Now, tr ai,wazw is reducible unless w =I in which case 
it takes the form traj,waz, which from Lemma 6 is tr waz-ai,. Also, from 
Lemma 6, tr uaj,az-w = tr wuai,a;,. If ay is a factor of w, then (5.9) takes 
the form tr a,,va;«a,,u, and, from (6.7), we have 


thy, © O22 ay = Str01; 0 a ew tr at a2 ai, u. 


From (6.10) the first of these terms is reducible unless v =F or eu =F, in which 
case it takes either the form — tr a},a%-au or — tr a},vai., which from Lemma 6 
are — tr eua},az. and —tr vazaj,. From (6.10) tr vazaa},u is reducible unless 
a =I or both v =I and u=I, in which cases it takes either the form tr vaz-a5,u 
or traz«az,, which, from Lemma 6, are equal to truvazaj, and tray, ax. 

We thus see that invariants of the form (5.9) are either reducible or equi- 
valent to linear combinations of invariants of the forms tr va% and tr yazaz, 
where w is a matrix product formed from some or all of the factors ap (P =1, 
2,..., R) and y is a matrix product formed from some or all of the factors ap 
(ee Ae 2ee ew = KE) nosfactors. noon in ’y: being repeated. 

Finally, we consider invariants of the form (5.10). These have total degree 
<R+41. If ay is not a factor of u, then (5.10) has the form tru, where wu is 
a matrix product formed from some or all of the factors ap (P=1, 2,..., R), 
no factor being repeated. If ay is a factor u then (5.10) has the form tr ai, 
or tray xa,y (a+). In the latter case we see, from (6.7) and Lemma 6, that 


a 2 2 
tray Layy = — tray, xy —traayy = — trayay — tryxa),. 


Combining the results obtained in this section with Lemma 4, we obtain 


Theorem 2. The trace of any matrix product formed from the R, 3 x3 matrices 
ap (P=1, 2,..., R) may be expressed as a polynomial in traces of matrix products 
of lower or equal partial degrees in each of the matrices and of lower or equal 
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extensions, having the forms 


try; ay “\UReEL)s (7.1) 
trv az, (722) 
tru, (7.3) 
tithy G02 a; 9 (Kae L, Mae 2) (7.4) 
and tt de; (7.5) 


where y is either the unit matrix or a matrix product formed from some or all of 
the factors @p (P=1,2,..., RK; P=-K,L), no two factors in y being the same; 
w is either the unit matrix or a matrix product formed from some or all of the factors 
ap (P=1, 2,..., R), such that ax 1s not the first or last factor in v and no two 
factors in v are the same; u is a matrix product formed from some or all of the 
factors ap (P =1, 2,..., R), no two factors in which are the same. 


We shall now assume that each of the matrices @p (P =1, 2,..., R) is sym- 
metric. Then, from Lemmas 6 and 7, we have 


Q 
try @; a? a? = tra} at a, ay = tt ay a, a a;. 
Using the relation (6.8), we have 
tr dy, @, a? a? = 1 tray, (a, a% a} + aj az a;) =0. 


We thus obtain 
Theorem 3. An integrity basis for the R symmetric 3X3 matrices Ap (P=1, 
2,..., R), under the full or the proper orthogonal group, is formed by the invariants, 
of total degree <R+2, 


te Re 4 Ahoe ea (7.6) 

trwax, (77) 

gat (7.8) 
and 

tra, (7.9) 
where # is either the unit matrix or a matrix product formed from some or all of 
the factors @p (P =1,2,...,R; P=+-K,L), no two factors in 2 being the same; 
w is either the unit matrix or a matrix product formed from some or all of the factors 
@p (P=1, 2,..., R), no two factors in w being the same and if ax is one of the 
factors of w, it is not the first factor or the last factor; x is a matrix product formed 
from some or all of the factors ap (P=1, 2,..., R), no two factors in x being the 
same. 


8. Integrity bases for five or fewer symmetric matrices 

From Theorem 3 we may write down directly finite integrity bases for five 
or fewer symmetric matrices under the orthogonal group. 

It is evident that if we write down all the invariants given by equations 
(7.6), (7.7), (7.8) and (7.9) for R symmetric matrices ap (P =1, 2,..., R), these 
consist of the invariants for each set of R—1 matrices which can be chosen from 
ap (P=1, 2,..., R), together with the invariants given by (7.6), (7.7) and (7.8) 
which involve all & of the matrices. Thus an integrity basis for five symmetric 
3 x3 matrices a, b, c,d, e may be formed by the integrity bases for each set of 


Matrices for Isotropic Continua 334 


four matrices which can be selected from a, b, ¢, d, e, together with the invariants 
given by equations (7.6), (7.7) and (7.8) which involve all five of the matrices. 

Single matrix a. From (7.8), (7.7) and (7.9) respectively, we obtain only the 
invariants 

tra, tra® and tra’, (8.1) 
which therefore form an integrity basis for the single matrix a. We note that no 
invariants of a single matrix are given by (7.6). 

Two matrices a and b. Writing down only the invariants given by (7.6), 
(7.7) and (7.8) by taking a,=a and a,=b, which involve both a and b, we 
obtain 

tre b29 t.b00— Acad... trboa. (8.2) 
Thus, an integrity basis for the two symmetric 3 x3 matrices a and b under 
the orthogonal group is formed by the invariants (8.1) and (8.2), together with 
the invariants formed from these by interchange of @ and b. 


Three matrices a, b,c. The invariants given by Theorem 3 which involve all 
three matrices a, b, € are 


trab*e?, trabe?, tracbe? and trabe, (8.3) 
together with the invariants obtained from these by permutations of a, b, ce. 
Now, employing Lemmas 6 and 7, we have 

rach c*—trebca—itrac be. (8.4) 
We thus have, with (6.8), 
tracbe?=Ftra(ebec?+ec’?be) =0. 
Thus, an integrity basis for the three symmetric 3 x3 matrices a, b, e, under 


the orthogonal group is given by the integrity bases for the three pairs of matrices 
which can be selected from a, b, c, the invariants 


trab’e?, trabe? and trabe, (8.5) 


and the invariants which can be formed from these by permutations of a, b, ce. 


Four matrices a,b, c,d. In a similar manner to that employed above, we 
see that an integrity basis for the four symmetric 3 x3 matrices a, b, ec, d under 
the orthogonal group is given by the integrity bases for the four sets of three 
matrices which can be selected from a, b, ¢, d, the invariants 


trbacda?, trbcada’, trabe’?d’, trabed’, trabed, (8.6) 


and the invariants obtained from these by permutations of a, b, c,d. 

Five matrices a, b, c,d, e. Again, we see from Theorem 3, that an integrity 
basis for the five symmetric 3 <3 matrices a, b, c,d,e under the orthogonal 
group is given by the integrity bases for the five sets of four matrices which can 
be selected from a, b, e, d, e, the invariants 

trabcd’e?, trabcede’, trabecde’, 


8.7 
traebcde?, trabcde’, trabcde 7) 


and the invariants obtained from these by permutations of a, b, c,d, e. 
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The integrity bases given in this section are redundant, since, except in the 
case of the integrity basis for a single matrix, certain of their elements can be 
expressed as polynomials in the others. Less highly redundant integrity bases 
for five or fewer matrices will be derived in a later paper. However, the integrity 
bases given here have the possible merit of symmetry with respect to interchange 
of the matrices. 


9. Application of Peano’s theorem to the determination of an integrity basis 
for R symmetric 3 X 3 matrices 


Let us define the elements of the R symmetric 3 x3 matrices ap (P=1, 


De scan ls) | 
) by ap =||a)|. (0.1) 


Let us suppose that J,, J,,..., I, is a set of polynomial invariants which forms 
an integrity basis for the five matrices @,, @,, @3, @,, @; under the full or proper 
orthogonal group. Then, it follows from PEANo’s theorem (see, for example, [3)) 
that an integrity basis for the R matrices ap (P =1, 2,..., R) may be obtained 
by polarization of J,, J;, ..., J, and of the invariant J of the six matrices a), @,, 
a3, @,, A;, 4, defined by 


Cheah ey Cae Uhh eel 
Aji, 422, 433, 423, 431, 412 
(2) (2) (2) (2) (2) (2) 
A141, 429, 433, 423, 431, Ae 


(6) (6) (6) (6) (6) (6) 
A141, 429, 433, 423, 431, Ne 


Thus, an integrity basis for the R matrices is given by 


7 Pa x 
atk) alka). qlKx) gd 
441 tee XIX dai) da\=2) oat”) 
Pet nh tege 22° XOX, 
. (9.3) 
AED gle) gan Cars = 
EEE CNY, IE SUD EI (L)) (Ly) Ly)? 
Oa; Oa; ... Oat m 
RWINCTO Gyan 2, colon ar Sannin ieee K,, £1, Le, ..., L, are integers, not necessarily 
all different, chosen from 1, 2,..., R and y is any positive integer. 


We note that the invariants generated in (9.3) from J, and J have the same 
total degrees as J, and J respectively. Now, taking R=5 in Theorem 3, we 
obtain an integrity basis for five symmetric 3 x3 matrices, the members of which 
have total degrees <7. This is given in § 8. Also, from (9.2), J has total degree 6. 
Thus, if we take as J,, I,, ..., ,, the members of the integrity basis for the five 
matrices @,, @,,...,@; given in § 8, equations (9.3) generate invariants of total 
degree <7. 

Since any invariant of the R symmetric matrices ap of total degree y is expres- 
sible as a polynomial in traces of matrix products formed from ap of total degree 
Sy, an invariant of the R matrices ap of total degree <7 must be expressible 
as a polynomial in traces of matrix products which either 

(7) involve five or fewer matrices, 


or (2) have total degree 7, involve six matrices and are of partial degree 1 in 
each of five matrices and of partial degree 2 in the sixth matrix, 
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or (11) have total degree 7, involve seven matrices and are of partial degree 1 
in each of them, 


(iv) have total degree 6, involve six matrices and are of partial degree 1 in 
each of them. 
With Theorem 3, we obtain 


Theorem 4. An integrity basis for R symmetric 3X3 matrices dp (P=1, 
2,...,R), under the full or the proper orthogonal group, is formed by the integrity 
bases for each set of five matrices which can be selected from ap, together with the 
invariants 


tr Ax, Ax, Ax, Ax, Ax, Ak,, 
2 
tr Ax, Ax, Ax, Ax, Ux, Uk, 
and 
tr ax, ax, ax, ax, ax, ax, ax, 2 


where K,, Ky,..., K, are integers, all different, selected from 1, 2,...,R. 


10. Application of Peano’s theorem to the contraction of isotropic matrix 


polynomials 
Let P be a symmetric matrix polynomial in the R symmetric 3 x3 matrices 
ap (P =1, 2,..., R) in which the coefficients of the matrix products are either 


numerical constants or polynomial invariants under the orthogonal group of the 
matrices @p. P is then, of course, an isotropic matrix polynomial. Let y be an 
arbitrary symmetric 3 x3 matrix. Let J,,/,,...,J, be a set of polynomial 
invariants which form an integrity basis for the R matrices @p under the ortho- 
gonal group. There exists an integrity basis for the R matrices ap and the matrix 


y which consists of J,,/,,..., 1, together with additional invariants each of 
which involves y. Let J,, Jz, ..., J, be those additional invariants in the integrity 
basis for @p (P =1, 2,..., R) any y which are linear in the elements of y. 


Since tr yP is a scalar invariant under the orthogonal group (full or proper) 
which is linear in the elements of y, it can be expressed in the form 


” 
try p = 2: Pa Jas (10.1) 


where gy, («=1,2,...,7) are polynomials in J,,/,,...,/,. If we define the 


elements of P by 
=i, 


Me 


then, from (10.1), we have 


Pj=5 ayy YP) tae (try P)| = +>) Peles ae (10.2) 


Now, suppose we choose the integrity basis for y and the k matrices ap in 
accordance with Theorem 4. Then, each of the invariants J, is either 


(i) linear in y and involves four or fewer of the matrices ap; 
or (ii) has the form tr yax,4x,4x,@,,@,%,, In which case 


|. Be. Oe 
OVi5 OV;i5 


= A, Ax, Ax, AK, Ak, + Ax, Ux, Ux, UK, Uk, 
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Ne 2 
or (ii) has one of the forms tr yd, 4x,Ax,4x,4K,; 


- 2 2 
tran, Yan, An, An,AK,, tr Ag, Ag, Y Ax, 4k, Ak, ; 


2 2 
tr dg, x, An, YAn,dx, OY tran, Ag, Ag, AK Y U,, 
in which cases, 


OJ 


aos pose. 
OVi; 


2 2 
hein Ax, Ax, Ax, AK, UK, + Uk, UK, UK, UK, UK,» 
qt 


2 2 
Ay Ax, AK, Ak, aK, 7 i ax, Ak,4K, AK, Uk,» 
2 2 
Ax, AK, AK, AK, Ax, + Ag, An, AK, AK,AK,; 


2 2 
Ax, Ak, A, Ak, Ag, + Ak, An, Ag, AK, Ax, 
or 


2 2 
Ok Ax, Ax, Ax, Ax, + Ak, ax, Ax, Ax, W,, 


respectively; or 

(iv) has the form tr yd, dx, Ax,Ax,4%,A,%,, in which case 
OJu Ou 
| Cy en; 
In cases (17) or (iit), K,, Ky, Kz, Ky, K; are five numbers, all different, of the 
set 1, 2,..., 8; in case (77), .K,,K,, ..-, Kg are-six numbets, all different, of ime 
set 1, 2,..., R. We note that in case (771), when we take into account the fact 
that all selections of K,, K,...,K; (all different) from 1, 2,..., R are possible, only 
the first three of the forms given for ||@J,/0y,;;+ 0J,/@y,,|| are essentially different. 
In case (7), OJ,/0¥,;;+ 0J,/0y;; 1s a symmetric matrix polynomial in four of 
the matrices Gp, say, Ax,,4x,,@x,,Ax,. From § 4, we see that such a matrix 
polynomial may be expressed as a matrix polynomial in which the terms have 

the forms (4.7) and (4.10) with a, b, e,d replaced by ax,, @x,, Ax,, Ax,- 


=. ar, ax, ax, 4k,4x, 4K, + ak, Ax, 4K, 4K, Ok, ax,- 


Combining the results for cases (i), (ii), (ii) and (iv) we obtain 

Theorem 5. Any symmetric isotropic matrix polynomial in R symmetric 3X3 
matrices Ap (P=1, 2,..., R) may be expressed as an isotropic matrix polynomial 
an which the terms have the forms 


2. 
| Fo Las 


2 2 2 2 Syeory 
Gx, 4x, + Ax,4,,, Ax, Ax, + Ax, ak, ax, Ax, + Ax, Ak} 


Ais 2 2 
ax, Ax, Ak, ax, 4, AK,; Ax, Ax, Ox, = ax, 4x, &k,, 


2 te 2 ee AP 
ax, Ax, Ax, T Ax, Wk, Ax,, ak, Uk, Ax, + 4x, Ax, Ak, ; 


2 an 2 2 2 : 
Or OR OK TK, © OK, OOK, Ox, Ax, Ak, Ux, Ux, + Uk, Ux, Ax, U, ; 


Ox, 4x, 4K, AK, T AK, Ax, Ax, Ax, , 


2 a 2 2 2 

ax, Ak, Wx, AK, T An, dg, Ax, Ag, x, Ok, 4K, AK, + Ax, Ax, Ok, Ax, , 
2 9 Dy Ap 2 79 2 2 

ak, Uk, Ux, UK, + AK, dg, Ak, Ak,, Ax, Ak, Ak, Ag, + Ag, dk, Ak, Ax, , 


Ax, Ax, Ax, Ax, Ak, + Uk, Ag, Ag, A, Ax, a K, 4K, Ax, Aq, A, + Ok, Ag, Ag, Ax, Ax,, 
AK, AK, A, AK AK, + Ak, Ag, Aq, Ag, Ag, Ax, Ax, Ax, Ak, Ag, + Ag, A, Ag, Ax, Ax, } 
Ax, Ax, Ux, UK, AK, TAK, AK, AK, AK, Ax,, Ax, Ag, Ag, Ag, Ak, + Ok, Ax, Ax, Ax, Ax, 
AK, AK, UK UK Ag, + Ak, Wk, Ax, Ax,Ax,, Ax, Ax,@k, Ax, Ax, + Ax, Ag, A, Ax, Ax,, 


ak, ax, ax, ax, ax, ax, te ax, ax, ax, ak, ar, ax, 5 
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for all selections of K,, Kz, ..., Kg (all different) chosen from 1, 2,..., R, and the 
coefficients are polynomial valle imvariants under the orthogonal group of ap 
eA 2,22, A). 

We may also express the symmetric isotropic matrix polynomial P as a 
symmetric isotropic matrix polynomial, in which each of the matrix terms involves 
no more than six matrices, from a knowledge of the elements of an integrity 
basis for seven symmetric matrices which are linear in one of them. Let 
I,(dx,,Ax,,-..,4x,, y) be the elements of an integrity basis for the seven sym- 
metric matrices dy,,@x,,...,@,, and y, which are linear in the elements of y. 
Then, tr yp may be expressed in the form 


YD = ee Viel Ae ce Og .Y); (10.3) 
a 
where the coefficients y, are polynomials in the elements of an integrity basis 
for the R matrices a,,d,,...,@p, >) denotes summation over the invariants 
a 

T(x, x,,-.-,x,, y) and >’ denotes summation over all possible selections of 
the six matrices d,,, Mx,,...,@x, from @,,Q,,...,@z. From (10.3), we obtain 

1 ol alle,» 
oy >! va( at + Sak (10.4) 

a \ OVij Vii 


The form given by (10.4) for P(=||P,,||) will depend, in general, on the integrity 
basis chosen for J,. 
11. Appendix 
In §2 we remarked that the relation (2.2) which is satisfied by any three 
3 x3 matrices may be derived from the Hamilton-Cayley theorem. If the matrices 
a,b and € in (2.2) are defined by 


a=|la,,||, 6 =|/5,;|| and _e=[e;;||, (11.1) 


then it can be easily shown that the relation (2.2) is equivalent to the relation 


Opes or , Onis 1115 
cs gan ree w, oe 
V271? teJ2? 1273? 1274 a,j, Di, Cj. O, (11.2) 
wpe wees ae On, 
Ds 0; tsJ2? Ore o;, js 


by expanding the determinant and interpreting each of the terms obtained in 
matrix notation. 

By a similar procedure, it can be shown that the Hamilton-Cayley theorem 
for a 3X3 matrix a is equivalent to the relation 
hija? 


fi? tifa? 


t2fi? 42 Je? to Ja? 


tifa Qin jn 4;,;,— 0. (11.3) 


mS 


ig fi? 13 ]2? 13]? 


0; 6 
0; 0; 
0; 0; 
0; 0; 


mS 


1471? asf? 1572? ts4 
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Both (11.2) and (11.3) follow immediately from the evident validity of the relation 


On Opps One Oy, 

0;, ji? 0;, 2? 0;, qa? 0;, V4 a8 6) x (1 | 4) 
Or 3 On Ons Ons 

a) 0) ) a) 


asf? 1yJ2? Us]? tg)a 


The relation (11.2) can be derived from (11.3) by a polarization procedure. 


Denoting the expression on the left-hand side of (11.2) by F;,,, and that on the 
left-hand side of (11.3) by G;.,, it is easily seen that 


as? 


0Gi,; 
——- (11.5) 
Ap] Onn 


6F, = Dyi Cmn a, 


1s Ja 
and hence that (11.2) follows from (11.3). 
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Einstein s Equations and Classical Hydrodynamics 


JUDITH BLANKFIELD & G. C. MCVITTIE 


1. Introduction 


As is well-known, the theory of general relativity involves field equations, 
known as Einstein’s equations, which equate the energy-tensor that describes 
the physical properties of matter to the geometrical Einstein tensor. A constant 
of proportionality is involved in these equations which has to be evaluated by 
an approximation procedure that reduces Einstein’s equations to those of New- 
tonian gravitational theory. Traditionally this has always been done by working 
out the Einstein tensor for a statical, orthogonal and isotropic metric whose 
metrical tensor departs slightly from that of special relativity (MCVITTIE 1956a). 
It is then shown that Poisson’s equation can be obtained by choosing the constant 
of proportionality to be itself proportional to the constant of gravitation. 

The question arises: What equations of classical theory would be obtained 
if one starts with metrics whose metrical tensors have more complicated forms 
than the simple type traditionally assumed? A partial answer to this question 
has been provided by one of us (McViTTIE 1956b). Assuming an orthogonal 
metrical tensor of a somewhat more general form than that commonly used and 
an energy tensor appropriate to a perfect fluid, the equations of motion and of 
continuity of classical hydrodynamics were arrived at. Moreover the procedure 
led to solutions of these classical equations in terms of indeterminate functions, 
one of which was shown to be the gravitational potential for the self-attraction 
of the fluid. Incidentally, it may be remarked that it is possible to solve the equa- 
tions of classical hydrodynamics in a very general manner in terms of indeter- 
minate functions directly and without the use of general relativity (FINZI 1934; 
WHITHAM 1954) though the physical interpretation of the resulting functions is 
not entirely clear. 

In McVittTie’s treatment of the reduction of Einstein’s equations to their 
classical counterparts two points emerged: firstly, it was evident that two 
approximation parameters were involved, one of which was essentially the con- 
stant of gravitation, while the other contained the reciprocal of the square of 
the velocity of light. Secondly, the solutions of the equations of hydrodynamics 
so obtained had the following property: if @ is the density of the fluid, and U 
the fluid-velocity, then the momentum-vorticity, Vx (@U), was necessarily zero. 
It was conjectured that this restriction arose because of the initial use of an 
orthogonal metrical tensor. 

The aims of the present investigation are, firstly, to put the approximation 
procedure on a sound basis by the introduction of dimensionless variables in 
the general relativity equations. It is shown that there are two approximation 
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parameters, denoted by 7 and « (Eqn. (2.06) below) the first of which involves 
the constant of gravitation, the second the ratio of a standard velocity to the 
velocity of light. Secondly, ways of removing the restriction to fluid motions 
implied by the vanishing of the momentum-vorticity are examined. It is shown 
that starting with a non-orthogonal metric certainly removes the vanishing of 
the momentum-vorticity. But, by a suitable application of infinitesimal co- 
ordinate-transformations, it can be proved that certain kinds of orthogonal 
metrical tensors (Eqn. (4.38)) can also lead to a non-vanishing momentum- 
vorticity. The nature of these metrical tensors may be seen by comparing (4.37) 
and (4.38) with (3.04). In the latter case the terms which do not have é? as a 
factor involve one function only, the gravitational potential y. But in the former 
case, the corresponding terms are all different. Thus a zero momentum-vorticity 
is not associated with the orthogonality of the metrical tensor but with the 
detailed form of the components of this tensor. 

In Section 5 we return to the nature of the approximation procedure. EIN- 
STEIN & INFELD (1949) have proposed a method of successive approximations in 
dealing with Einstein’s equations which employs a single arbitrary small para- 
meter A. The relationship of this parameter to 7 and ¢ is examined and it is shown 
that it corresponds to ¢ and cannot be identified with 7. Thus the degree of 
arbitrariness of 4 appears to be limited. 


2. Dimensionless variables 


In order that the magnitudes of variables and functions with physical di- 
mensions may be compared, dimensionless quantities will be introduced. They 
will be related to ordinary physical variables and functions expressed in cgs 
units in order to facilitate any future physical applications. 


Consider the following metric in the cgs system 


3 : 3 
dst = (1 + aTd (Qt — ZY) Vdd — 2 D2 V 2%, (2.01) 


7 
t=1 M4 


where c is the velocity of light, x =82G/c? and G is the gravitational constant. 
The physical dimensions of the above quantities are, 


Ss time , 


as time, 


wy (4, "= 4, 1, 2,3) ay 


Thus the V,., have the same dimensions as gravitational potentials. If an arbi- 
trary fixed length, /, and an arbitrary fixed time-interval, ¢, are chosen, dimension- 
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less coordinates, §“, may be defined as follows: 


Cee ty Ad 2h), 
(2.02) 


The equations (2.02) define a coordinate transformation under which the metric 
(2.01) becomes 


3 
dst =P 42M) (dy 2! Ye Vyas ast ae HHH) ER (2.03) 


4=1 


Dividing (2.03) by # and defining o =s/t yields 


2 3 : 
do® = (1 + %Vj,) (a6)? — 24S) nV satiaee— WO? SY (4 +2) (AE)®. (2.04) 
11 t=1 


The expressions x V,,, are dimensionless, but x and V,, separately are not. Let 


V be an arbitrary fixed gravitational potential, and let xV,,=n7y,,, where 
4 =xV and y,,=V,,/V. Further, let v=l/t and ¢=v/c, then (2.04) becomes 


uy 
do” = (1 + 7 Yaa) (45%) yb 26 Sina db ad — et > (1 1+ yis) (@E)?, (2.05) 


where the dimensionless variables are related to those in (2.01) by 


—— (AeA 2503) 5 
selene 
pase 
pied 
aren 
8aGV 
eka = (2.06) 
Pia ae 
pee 
=i 
ee 
aa Ce 


The metrical tensor g,,, for the metric (2.05) is 


Gsa = 14+ Yea, 

ao ae i (+ =1, 2, 3), 
84 V4 | (2.07) 
ipo (1 7; (2 =1, 2, 3), 


The energy tensor and Einstein’s equations are known for the metric (2.01) in 
cgs units (McVITTIE 1956c). Let %,,, R,,, and T“” be the metrical, Ricci, and 
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energy tensors corresponding to the metric (2.01). Then 


ds? =f yA ax, (2.08) 
EA eine ey Olah ae 

ae aa! ax” -35 La Se ge ti vf — xt ) (2.09) 
aa Bi BD \ Wee Ola Spee 

oy | P EEN EEN ese DENG 
L (2 4 OS HES § (oy ( ) 
= 8n GT = Rate, (2201) 


where the Christoffel symbols in (2.09) are defined in terms of (2.08), and in 
(2.10) @ and # are the density and pressure in cgs units. In order to find expres- 
sions corresponding to (2.09) to (2.11) for the metric (2.05), the transformation 
(2.02) will be used as an intermediary step. This transformation will be denoted 
by a prime. Let g os : Re , and T’“” be the transforms of g Ci Ruy, and 7 Samder 
the transformation (2.02). Then, since g,, is given by (2.01), g,, by (2.03), and 
Su» by (2.05), it is easily seen that 


Pahes 
guy ou p Suv 


2 
git = pagte (2.12) 
The Ricci tensor R,, for the metric (2.05) is defined by analogy with (2.09) as 
follows: 


init 6 See eee @lny=g 
Ruy = gen age — ge wd thush bot — Wah ; (2.13) 


where now the Christoffel symbols are defined in terms of the metric (2.05). 
Combining (2.13) and (2.12) with (2.09), gives 
Ieee 


uy? 


Re’ — PR (PEN (2.14) 


The energy tensor 7” for the dimensionless metric (2.05) should also be dimen- 
sionless. By analogy with (2.10), 7”” is defined as follows 


T= (e+e) (Go\(F_) — et ep, (2.15) 
where 
@ oe 
ear 2 (2.16) 
p=B> 


By comparing (2.15) with (2.10), 7”” can be found in terms of T’“”. Thus 
ee 2 aé a& uy 
ce 


Flee RMGEMeE)-erk] a 


c2 


_e ee 
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Einstein’s equations can now be derived for the dimensionless metric (2.05). 


Since (2.11) is a tensor equation, it remains valid after a coordinate transforma- 
tion. Thus 


— 8nGT = RAZR, (2.18) 


Substituting the dimensionless tensors given by (2.12), (2.14), and (2.17) into 
(2.18) gives 


OG Vat 1 py 1 A ay 
or — Pe if at 2° #e Be 
8uaGV » 1 ou 
3 Th? = REY — — gt? R, (2.19) 
or 


poeslld By @ Je 27 See a 2 
2 2 § } 


The Newtonian approximation to Einstein’s equations (2.19) will be examined 
in the course of the following sections. That approximation will be defined as 
the result of first cancelling factors —y/e? on both sides of Einstein’s equations 
and then setting «* equal to zero. The second step implies that the standard 
velocity v is such that its square is negligibly small compared with the square 
of the velocity of light. Let 
aks dE 4 


L 
Wh = (2.20) 
then, neglecting terms of order 7, (2.05) becomes 
3 
1 = (ut)? — ¢2 >) (u’)?, (2.21) 


i=1 

so that in the Newtonian approximation “4 becomes equal to unity. The u' 
(¢ =1, 2,3) become the dimensionless Newtonian velocity components U; where 
the vU, are the ordinary Newtonian velocity components expressed in cgs units. 
The density @ and the pressure # become the dimensionless Newtonian density 
and pressure where oV//? and fV/é are the ordinary Newtonian density and 
pressure. The coordinate &4 becomes T where ¢T is the Newtonian absolute time 
in seconds, and the &* (¢ =1, 2, 3) become X; where the /X; are the usual New- 
tonian space coordinates. The y,, become the same functions of T and the X; 
as they were of the &“ (u =4, 1, 2, 3). 

In connection with the above symbols, certain conventions will be employed. 
Greek indices will range over the values 4, 1, 2,3 and Latin indices will take 
on the values 1, 2, 3 only. The indices /, m, m will always denote a cyclic permuta- 
tion of 1, 2,3. A comma will be used to denote partial differentiation with respect 
to the coordinate indicated by the subscript after the comma. For example, 


However, the same symbol y33 44 will be used to denote the derivative with 
respect to the Newtonian coordinates when no ambiguity can arise. In such an 


instance 
Yea 1a = 
Ba 14 = ox ors 
Arch. Rational Mech, Anal., Vol. 2 23 
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The symbol 7? will be similarly used in both contexts. Thus 


a oY e oe 
y > (eS)? r a aN Fe fe 
OF y= @2 o2 ~ o2 


(@X,)? © (OX, 1 (OX,)*’ 
whichever is applicable. 


3. Approximate forms of Einstein’s equations. 
Non-orthogonal metrics. Hydrodynamics 
The elements g,,, of the metrical tensor have already been expanded in powers 
of 7 in equations (2.07), with 7? and higher powers of 7 being neglected. Further, 
the y,, of (2.07) may be expanded in powers of ¢. The simplest case is 


Ug Ns, 
Yu—yY, 
Yai =O, 
or 3 
do? = (1 — ny) (4&4)? — OA +7 y) (48). (3.01) 


Again neglecting powers of 7 greater than one, Einstein’s equations as given by 
DiNnGLE’s formulae (McCVITTIE 1956d) are 


Un api Uh oe 
re al =e Ve, 


nH 74l 1) 
e2 if 62 W al, 
.02 
ie rime (3.02) 
€ 
Ui Hi ul 
e2 TE: = 2 Yaa 


where 7“” is given by equation (2.15). Following the procedure outlined in 
Section 2, the Newtonian approximation to equations (3.02) is 


Gate are 
oU,=Yar 
OU ee (3.03) 


o UP ep =e 
Clearly, in the above solution to the equations of hydrodynamics, at most one 
of the velocity components can be non-zero. Thus the very simple metric (3.01) 
leads to a very restricted solution. 
Since it is more convenient to work with orthogonal metrics, the next step 
in expanding the y,,, in powers of ¢ could be 


Yaa = — (y+ 2e7y,), 
Vit =O 2e Ue 
Val = 0, 
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or 3 
dot = [1 — nlp + 2ebya)] (264)? — 0° Xi [1 + aly + 26? y%)] (48!) (3.04) 


McVi1TIE (1956b) has worked out the consequences of (3.04) but with y,=0. 
The inclusion of the quantity y, is made here for reasons of symmetry, and 
represents no essential change from McViTTIE’s treatment. The results alone 
are repeated here, and the reader is referred to McVitTTIE’s work for the details. 
Einstein’s equations for the metric (3.04) are 


= ae Pe s rey +- a Pr sp == Yi.nn)| ; 


€ ,m,n 
4 i —— fs Lv, a0 + (Yn = Wn), a1| ) 
(3.05) 
a a Ti" a Ln Wa), 1m ) 
se os ee ana . (Yon Wa),n n + (Wy ce 1A) ea ae Waa sti (Yn = Wn), a4] , 
with the Newtonian approximation 
aaa dat 2 
C= , 
Eh gee (3.06) 


Q UU, a (Pn a Wa),1m> 
0 U; str p = (Yn a Va), nn AP (p, mae A a Waa: 


By analogy with the definition of the vorticity of a fluid, the vector whose 
components are 


Q, ars (0 Wi, a ( Us (3.07) 


will be called the momentum-vorticity. The motion of the fluid described by 
(3.06) is subject to the restriction 
2,=0, (3.08) 


so that its momentum-vorticity is zero. Thus, again, an orthogonal metric has 
led to a restricted motion. 


In order to obtain a more general solution than those obtained above, a 
non-orthogonal metric will be used. In particular, let 
Yaa = — (y+ 287 y), 
m= Y + 2e yp, (3.09) 
Var= éC)- 
Since DINGLE’s formulae do not apply in this case, the energy tensor will be 


calculated from (3.09) according to (2.19). For the sake of convenience, the 
calculations will be carried out in terms of the functions y,,, and the identification 


Doge 


344 JupiTH BLANKFIELD & G. C. McVITTIE: 


(3.09) will be made only at the end. From the metrical tensor 


Gsaa = 14744; 
&= — (1+), 


(3.10) 
S41 = —EYVat, 
Sim ae o 
the Christoffel symbols of the second kind 
{5} = 38°" (Bap. + Box, 8 Sepals (3.11) 


must be computed. Since all the derivatives of the g,, are of order n, it is suf- 


ficient, in (3.11), to take 
git 


’ 


Baa (3-12) 
BO Me tai). 
Thus the Christoffel symbols are found to be as follows 


{aa} = S V4a4,4> 


{a} = - 44,1 


{ii} = a Y11,4 —NEYVaLI 
{int = — (Yai,m + Yami) » 

{a4} See [Vana Tr =F Y44,l> 

{e} =D (3.13) 
{ain} a = (Yat,m — Yam,t) > 

{i} fal SVD 


{i} a Lym: 


{i * ia a Vil,m> 


Since 
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then ain v= 4 

ny — 1 

ae a aul Pa Vaav- (3.14) 
In (2.13), the terms-{%,} {7} and {3} fale are, in view of (3.13) and (3.14), 


of order 7? and are therefore neglected here. Using (3.13) and (3.14) in (2.13), 
the quantities R,,, and R are found to be 


3 
2) 1 
Ryy= oe (vs, cA ora) ae $e Va8,i) ’ 
t=1 
1 
Ry, ay a [nn = V apad) ca eas (Varo tn oH Vam,lm ae Val,nn nom Yai,mn)| ) 
GAS) 
Se 4 [(Yaa a5 Vara) lm =f € (Yar, ma ele Vam,14) | , 
Ri 0 [(va + Yam te Vans aeahe yi nn 4 Vil, me 2E Val, 147 e Vil, halls 
1 z : 2) 1 
R= Ry,— ee Ry = n| 2. (visas 7B V4i,4i 2 Ys4,33) 7 
ie ek (3.16) 
1 
a. “se oS (vu, mm + Vii, ms . 
1,m,n 
Finally, using (3.15), (3.16), and 
Ri Ray, 
RY — — Ry, 
lm G.17) 
R = Rims 
R'=_R 
as ek ll» 
n (2.19), Einstein’s equations are as follows 
= ptt a (Oi) wise ale 
l,m, n 
Ui Aa” 
e2 uh oo rn n zr Vm wn) Ai 
oom 4 Waa is ae Vam,lm ihe Val, nn Yainn)| ) 
(3.18) 


eat tices a Mas Yna) om a © arena ct Namba) (> 


2) 
‘ Tl = (co Vanes ar (Yanan + Poin — 
- (yaa zi Vin) ni a (Yaa at Van) ae) - 


In accordance with the procedure stated in Section 2 for obtaining the Newtonian 
approximation, the factor —y/e? is cancelled on both sides of equations (3.18) 
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which, with the aid of (2.15), become 
(o + ep) (#4)? — 2p = — #2 Vis eam P inn) > 


m,n 


(¢ + 2p) wut = 5 |’ + Ynn), arr 
i t (i mmir Pal ones ) ae Yan,tn)| : 
(o + ep) uu" = — 5 [(Yaa + Yan),tm + €(Yaima + Yam,ta)> i 
(0 + 2°) (W)8+6 = —4] mm + Yan),sa—— aman + Yanan) — 


a + (yaa a Ve amie 2s (Yaa = Vane) : 


The left hand sides of (3.19) remain finite as e” tends to zero, but without intro- 
ducing additional assumptions about the form of the y,,, the right hand sides 
do not. However, the form of the y,,, given by (3.09) is such that the right hand 
sides of (3.19) do remain finite as ¢? becomes zero. For, substituting (3.09) into 
(3.19) gives 


(0 a &*p) (w*)? foc ep a hae! Vp Pa a » Cae ae EN, , 
, m,n 3 ; 
(0 + 9) Us = rat aie E' (Wm + Yn), 41 +4 (72 Cr — 26,1) > 
(o oi &*p) wu” = (Wa = Wn), 1m ay z (Cima 1% vray) , (3.20) 
(0 + &) (0')? + P=-—YPa- E (Yn =?) aac 
or Giaeaon aie (Pn We) awe (Om — Panne 


It is now possible to set ¢? equal to zero and obtain the following Newtonian equa- 
tions 


C= — VV" ey, 
oU,=YPaits (v2 Cy = 320) , 
eU,U, = (Ys — Pn) tm — & Comact Genta). 
OU +P = =) cat Gagan Cader el ae 
The restriction (3.08) has been removed, for in the present case, 


Q, = z Vy? (ean ra eer 3 (3.22) 


Thus, the non-orthogonal metric 


do? = [1 —n(p + 2ey,) | (dé)? — 


(3.21) 


a Se dei : (3.23) 
Baby 2.0: d8ds4 — D1 + (w+ 26%y)] GE), 
1= =a 


has led to a solution without the restriction (3.08) which was inherent in the 
orthogonal metric (3.04). 


In general (McVITTIE 1956e) TH — 9 (3.24) 


where the semi-colon denotes convariant differentiation defined by 


Tikes DAC) wee lta (3.25) 
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Since terms of order 7? have been neglected, and since, by (3.13) and (3.15), 
both the energy tensor and the Christoffel symbols are of order 7, equations 
(3.24) and (3.25) imply, to this order approximation, 


TH” = 0. (3.26) 


The equation of continuity and the equations of motion of a fluid under the 
force of its pressure gradient alone are contained in the Newtonian approximation 
to equations (3.26) in the following form 


3 
@4+ 2 (eG); =0, 
ba (3.27) 


3 
(e U)at Le UU; + 61; P),; = 0. 
J= 


Thus the expressions for Newtonian density, pressure, and velocity given by 
(3.21) satisfy the equations of motion and continuity (3.27) for any y, y,, and 
¢,, a fact which may be verified by direct substitution. However, since there are 
ten equations in the set (3.21) to determine only five quantities, there must be 
five extra equations, or consistency relations, which are relations which the y, 
y,, and ¢; must satisfy in order that the quantities @, , and U, be consistently 
defined by (3.21). These consistency relations are 


1 m m 
(Wa as Wn), ae Dy (C7, ma ai on 1 4) es aC ee £ 5) 3 .28) 


3 WY 23+ Px)? 
Cail Co ao (Wr Wa),22 + (Y2— Ya),11 + Yast od 


= l Le (Part)? 2 
Coaot+Cs,43 (We Wa),33 + (Ys — Ya),22 + Vi (3.29) 


vs ! | ” (Y,a2T Pe)” 
=Cza3t+41,41 (ps Wa)ar + (Yr Wa),33 + Py , 


where 


3 
P=2V? fr — 2 26, «1. (3.30) 


Equations (3.28) were obtained from the first seven of equations (3.21) and may 
be used to express the quantities (y,—y,) 7, mm terms of yw and ¢;. Thus, the 
density, pressure, and velocity may be expressed in terms of y and ¢, alone, as 
follows 


Org ee 
U, = — pay (Part 9), (3.34) 
p= Vad a 
where X is defined by 
3 
= (p41 t+ 1) (Y aj + %) : 
a eae ae \ —Oie- (3-32) 


Equations (3.29), which were obtained by setting the three expressions for p + 44 
in equations (3.21) equal to each other, are the same as the requirement that 
equation (3.32), which defines X, should be integrable. 
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The hydrodynamic motions obtained above from the non-orthogonal metric, 
(3.23), are more general than those obtained by McVITTIE (1956b) from an 
orthogonal metric, but the treatment is the same. The orthogonal case can be 
obtained from equations (3.31) and (3.32) by setting =O. 


4. Infinitesimal coordinate transformations 
In this section, the loss of generality, if any, due to using a spatially ortho- 
gonal metric will be considered, as will the relationship between orthogonality 
and the restriction (3.08) which makes the momentum-vorticity vanish. These 
questions will be discussed in terms of infinitesimal coordinate transformations. 


The infinitesimal coordinate transformations from (E4, A, £2, £3) to (E45Es exes) 
which will be considered are defined by 


ve Eu ty ft (E4, él £2 £3), (4.01) 
If F is any function of the coordinates, then the difference between F as a function 


of the gu and F as a function of the &’ is at most of order 7. Thus, since the 
metrical tensors here considered have constant zero order terms, the difference 


between them as functions of the Eu and as functions of the & is of order 7?. 
Terms of order 7? are being neglected, so the distinction is not made. In the 
remainder of this section, then, whenever the variables of a function are not 
explicitly stated, they may be taken to be either set of coordinates related to 
each other by a transformation of the form (4.01). Following the work of CurTIS 
(1950), the metric may be written 


do® =@,, dé" dé’, (4.02) 
do? =g,,dé" de’. (4.03) 


The differences between the elements of the metrical tensors g,, and @,, are of 


order 7, so that . 
Sie = Cee aes (4.04) 
Equation (4.03) may be written 


do* = (&,, +7 4,,) (dé + ft, dé) (df +nf',4&) 
= [Bur + (uy + Buchs +80 f%,)] det de. Nie 
Comparing (4.05) with (4.02) gives 
Avg = — Oral ee bye oe (4.06) 
a Ene = 2us Seats ee (4.07) 


Using a transformation of the form (4.01) with the corresponding change in 
metrical tensor given by (4.07), it will be shown firstly how the metric 


do? =g,, dé" d®, (4.08) 
where / 
sa = 14 O44; 
G41 = — EN Og), 
(4.09) 


Si = — &(1+n0,), 


0 — 2 
Sig = —= 8 1 O1ms 
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may be transformed into a spatially orthogonal one, and secondly, how, under 
certain conditions, it may be transformed into an orthogonal metric. In the first 
case, since g;,, is to be zero, the corresponding equations of the set (4.07) are 


0= — 8° 9) Og n ( Soe ef"), (4.10) 
or I) m 
Oim=lhmth- (4.11) 
Let @;,,=©@,,; be any function such that 
O1m = Om, ims (4.1 2) 
then, a solution of (4.11) is 
jie - (Mim | Win ee Nee (4.13) 
If /* is chosen so that 
(=O. (4.14) 


then the transformation given by (4.13) and (4.14) yields the following spatially 
orthogonal metrical tensor 


Ssa = 117044, 
S41 = —enlout > (nn —@1m— rn) a1), 
G1 = — & [1 +9 (01+ Onn — Oim — in), 11) |, ae 
Sim = O. 
The metrical tensor (4.15) is of the same form as (3.10). In fact 
Yaa = Fa, 
Yat = 941+ — (Onn — Om — On), 41 (4.16) 


2 
Vi1 =O + CORR 1p), 11+ 


Thus, it has been shown that the spatially orthogonal metrical tensor (3.10) 
may be obtained from the more general metrical tensor (4.09) by an infinitesimal 
coordinate transformation. This transformation leaves the approximate form (to 
order 7) of Einstein’s equations (3.18) unchanged. 

In the second case, that of transforming (4.09) into an orthogonal metrical 
tensor, g,, as well as g;,, is to be zero. Now, in addition to equations (4.10), 
the following equations must hold, 


OS Se A ee (4.17) 
= fip=efa— ey). (4.18) 
Differentiating (4.18) with respect to em gives 
Ka=e fg EO4l,m> (4.19) 
and reversing indices gives 
Pai = 6 Pia — €04 m, 1° (4.20) 


The sum of (4.19) and (4.20) is 


On = emt 4; os heayis are E( Oat ats 4 m,1) Fz Salar. me ENC aihn, a 4m, 1) G (4.21) 
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If the functions o,,, are such that the three equations (4.21) define consistently 
a function /, then functions /’ can be found from (4.18) or 


ee fig =e 04+ fi. (4.22) 


In order to simplify the notation, it will be assumed that the transformation to 
a spatially orthogonal metrical tensor (which can be made without loss of gener- 
ality) has already been made. That is, o;,,=0. It will next be shown that a 
necessary and sufficient condition on the functions 04; to insure the consistency 
of the equations (4.21) is 

O41,mn — %4m, In: (4.23) 


The necessity of the condition follows from 


Cran = bea (4.24) 
since, from (4.21) with o;,,=0, this is equivalent to 
Ost nm + Can, tm = Cam, nt 1 Can, mi- (4.25) 
If, on the other hand, the condition (4.23) holds, then equations (4.25) are 
satisfied. Let F’ be any three functions such that 


Be ae: Ze (64 m,n + O4 oe) > (4.26) 
Equations (4.25) imply 
Five P gay a tas: (4.27) 
or 


Fix F*+ A,(&,&, &) + BS, 8,8) +0, 8,28), 
FLaFS Ay(E1G2 2) Bache) aie ceca 
Let three functions G’ be defined as follows 
Gus FE BC. 
G=F*+A4,+C,—C,, (4.29) 
G§ = F3+ A,+ B,— B,, 


then it is easily verified that 
Gisn Sz. Vs ie ae 3 € (4 m,n a O4n, mal , (4.30) 
and 
G=G=G, (4.314) 


If f* is taken to be the function which is equal to each of the three functions 
G', then the sufficiency of the condition is established. 

In Section 3 it was shown that the orthogonal metric (3.04) led to the vanish- 
ing of the momentum-vorticity. It was also shown that the non-orthogonal 
metric (3.23) removed that restriction on the motion. However, the condition 
(4.23) for the existence of an infinitesimal coordinate transformation which 
transforms the metrical tensor (3.10) into an orthogonal one is neither necessary 
nor sufficient for the vanishing of the momentum-vorticity. To show this two 
examples will be used. The first will be a case where the metrical tensor can be 
transformed into an orthogonal one and the momentum-vorticity is non-zero, 
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and the second example will be one where the metrical tensor cannot be so trans- 
formed but the momentum-vorticity does vanish. 


The condition (4.23), in the case of the metrical tensor (3.10) is 
Can Solas (4.32) 


For the sake of an example, consider the case where only one of the ¢,, say ¢,, 
is different from zero. The condition (4.32) becomes 


C1230. (4.33) 


Following the procedure indicated above, a function @,,=@4, is chosen such 
that both 


My4a14 =, (4.34) 
and 
14,230. (4.35) 
Then, the transformation defined by 
4 & 
f= 2 Wy4,4> 
P= 3 Osa, 
2 = (4.36) 
P= 2 %14,2> 
P= 214 3) 
transforms the metric (3.10), with €¢,=¢€,;—0, into the orthogonal metric 
4 
do? =) 81, (ae"")?, (4.37) 
Z—E 
where 
gaa =1—n(p + 2ey,— E'Wy444)> 
i er ny 26? py, — @y4,11) |, 
bhgeiiirin (4.38) 
Soy ks [1 tay + 2é Wo + W429) |; 
33= e7|4 t 7 (wy 2 €? ps + W433) |- 


The first order terms of Einstein’s equations have not been affected by the 
above transformation, nor has the Newtonian approximation. Even though the 
metrical tensor is orthogonal, there are two nonvanishing terms of the momentum- 
vorticity, namely 

(e Uy),2— (e U2) 1 = V7 CoH 27 Wy4 142, 


dw io Aino siti (4.39) 
(o Uj) 3 os: (0 Us) 4 =3V Cis =3V Wy44,143- 
For the second example, take 
by a ise 7 a 
(4.40) 
Ce = Cs =U 


In this case, since C, 53-0, the transformation to an orthogonal metric cannot 
be made, but the momentum-vorticity clearly vanishes. Thus there is no intrinsic 
connection between the orthogonality of the metric and the vanishing of the 
momentum-vorticity. 
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5. On an approximation due to Einstein & Infeld 
EmnstEIN & INFELD (1949) have developed a method of successive approxima- 
tions in terms of what they call an arbitrary small parameter 4. The purpose 
of the present section is to investigate the significance of A in terms of the ap- 
proximations used in previous sections of this work. 
The metric (2.05) may be written in terms of general relativity units, for 
comparison with the work of ErnsTEIN & INFELD, as follows 


[d(cto)]*= (1+ raa) [4(c 2&4) ]? — 2 Din yacd LE) d(c ts) — 
ae . (5.01) 
= + vii) [4 (LE). 


EINSTEIN & INFELD used the metric (in general relativity units) 


3 . . . 
ds? = (1 + Igo) (4x)? +2 Dig, dx dx' — dD (6,,—h,;)dx' dx’. (5.02) 
i=1 i,j 
If the two metrics (5.01) and (5.02) are to describe the same space, it must be 
the case that 


CUG =, 
Chica 
l& =x", 
5.0 
NYsa=M00; ee 
— 194; = Noi, 
aie O33 Vii = h;;. 


The parameter A is introduced as a scale factor by the introduction of an 
auxiliary time t, which, in terms of the time coordinate x°® of (5.02), 1s 


TA, (5.04) 


All computations are carried out in terms of the coordinates (t, x1, x2, x) rather 
than with (x®, x1, x?, x8). The reason EINSTEIN & INFELD give for doing so is 
that derivatives of functions with respect to t can then be treated on the same 
footing as the space derivatives. In Section 3 of the present work, all calculations 
were carried out in terms of the coordinates (&4, &1, £2, £8) and all derivatives 
were assumed to be of the same order of magnitude. Thus the systems (r, x1, «2, x3) 
and (&, &, & &) can be identified because all derivatives are treated on the 
same footing in both systems. Now since /&'=.«', it must also be the case that 


A (5.05) 
Combining (5.04) with (5.05) and ct&4 = x° gives 


Peery ee ee 
A a Ghee ee (5.06) 


c 


Thus the coordinate system used in Section3 forces the identification of J with e. 
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EINSTEIN & INFELD expand their /,, in powers of A as follows 
3 
Noo = I?(3 Yoo) +4(4 Yoot? 2d 5) iam 
2 4 s=14 


hoi = BY0; ® Voi press (5.07) 


3 
h;; =H (3 0:; Yoo) =i As| Mapa = 0;; YS Ver = 5,; Yoo) alee ada 
2 4 s=14 4 
where the above y,,, are not to be confused with the y,, used heretofore in this 


work. Combining (5.03) with (5.07) gives 


3 
1 Yaa = (3 Yoo) a(3 Yott Dd Yes) bo 
2 4 Sa ed 
PRY as taNoagind Kowmaiy (5.08) 
5 


3 
— 1755 = BE Yoo) +I(ys—4 > Yes + #Y00) + ii: 


=14 4 


In Section 3, the y,, were expanded in powers of « in the following way: 


Vis = Pee i.) , 
Vai = 86, (5.09) 
Vie = Vit 2E7y,. 

Substituting from (5.09) into (5.08) gives 


3 
= np — 21 & p= I (5 Yoo) + 18 ($7004 ED yu) te 
ae l = A oy oe, (5.10) 
3 
3 
— ny — 2m p= I? (B00) + (ys FY00— FD Yas) to 
2 4 4 s=14 


Equations (5.10) are consistent with the identification Ae together with 


| 
i Seonkine 2 foo 
SSu (5.11) 


3 
BR ly nye Toe eae 
2 62 Yi Yer 2 Yoo 2 dps 


Thus the method used in Section 3 would appear to be equivalent to working 
to the order A by the method of EINSTEIN & INFELD. 

The identification 2 = ¢ was dictated by the choice of coordinate system. The 
parameter / is said by ErnstE1n & InFELD to be arbitrary, which would suggest 
that other identifications should be possible. At first glance, it would seem 
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possible to identify A with 7%, but that cannot be done consistently without 
leading to the equality of ¢? and , which, in turn, leads back to the original 
identification (5.06). Thus, the parameter A is only as arbitrary as ¢ is, which 
means that it can be varied only so far as the ratio v of the scale factors / and ¢ 
can be. 

6. Notation 


o dimensionless length of arc. 
g,» dimensionless metrical tensor. 
él dimensionless coordinates. 


E small dimensionless parameter inversely proportional to velocity of light. 
7] small dimensionless parameter proportional to constant of gravitation. 
R,, dimensionless Ricci tensor. 


T"” dimensionless energy tensor. 

Yur» W Yu» S, functions of coordinates used in expansion of metrical tensor in 
terms of small parameters. 

0 dimensionless density. 

p dimensionless pressure. 

U, dimensionless Newtonian velocity. 

Q, momentum-vorticity. 


Greek indices range over the values 4, 1, 2, 3 and Latin indices over the values 
1,2,3 only. The indices /, m,” always denote a cyclic permutation of 1, 2, 3. 
A comma denotes partial differentiation with respect to the coordinate(s) indi- 
cated by the subscript(s) after the comma. 

Other symbols are used in one section only and are explained where they 
are used. 


This work is based upon a dissertation submitted by JupITH BLANKFIELD in 
partial fulfillment of the requirements for the Ph. D. degree at the University of 
Illinois. 
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The Determination 
of the Inverse Matrix for a Basic Reference Equation 
for the Theory of Hydrodynamic Stability 


ALBERT L. RABENSTEIN 


Communicated by C. C. LIN 


1. Introduction 


In the development of an asymptotic theory for the Orr-Sommerfeld equation 
of hydrodynamic stability, the equation [7] 


(1.1) wi? + (zu +au' + Bu) =0 


plays the role of a reference equation. This equation is equivalent to the first 
order system 
d 


(1.2) qe ole A) U 
where A(z, A) is the matrix 
O 1 05,10 
O O A O 
B Ag (2,4) = 
(1.3) 0 (2, A) ee ei 
—p —« —Az 0 


If u;, 7=1, 2,3, 4, are the components of a column vector of a matrix solution 
U(z, A) of (1.2), these components are related to a solution uw of (1.1) by the 
equations 


ver 


(1.4) Ho Us Ue Ug = AW Uy A 
For every fundamental matrix solution U of (1.2), the matrix (U~)* is a fun- 


damental matrix solution of the adjoint system 


d 
(1.5) ee eA 
The adjoint of (1.1) is 


(1.6) vi? + A2[zv" + (2 — a) v’ + Bu] =0. 


This has the same form as (1.1), the only difference being that « is replaced 
by (2—«). Hence the asymptotic forms of its solutions, for large values of | A], 
are known from [J]. The components v;, 1=1, 2,3, 4, of a vector solution of 
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(1.5) are related to a solution v of (1.6) by the equations 


v,=(4—1)u—20 —1? 0", 


Vg=ZU+A?v", 
1.7 
( ) tae Aye 

0, =U. 


In particular, the elements in the last column of the matrix U™, corresponding 
to a fundamental matrix U for (1.2), are solutions of (1.6). 


The hydrodynamical equation [/] is of the form 


2 2 
(1.8) Ge A yy pee 2 On Aygo? = 0. 
n=0 n= 


From any given solution of the reference equation (1.1) may be constructed a 
formal expression which can be shown to represent a true solution of (1.8) 
asymptotically in a certain region of the complex y-plane. The details of this 
analysis will be presented in another paper. However, in order to carry out the 
proof of the above statement, it turns out to be necessary to know the asymptotic 
behavior, for large |A|, of the inverse matrices corresponding to various funda- 
mental matrix solutions of (1.2). 

In this article we shall derive formulae which will enable us to express the 
elements of the inverse matrix U™, corresponding to any given fundamental 
matrix U of (1.2), in terms of specific solutions of (1.6). The asymptotic behavior 
of these elements will then be known from [7]. Furthermore, this explicit know- 
ledge of the elements of U in terms of the solutions of (1.6) makes it convenient 
for us to write the asymptotic expansions of the elements in different forms for 
different sectors of the complex z-plane. This can be done by using the relations 
(2.6) of [7], with « replaced by (2—«). This turns out to be necessary in the 
analysis of (1.8). 

The principal results of this article are the formulae given in Section 4. 
These express the third order Wronskians of the solutions of (1.1) in terms of 
the solutions of (1.6). The use of these formulae may be illustrated by the 
following example. Let U be the fundamental matrix 


By(z,«) By(z,«)  Ba(z,«) Bs (z, a) 
Bo Bi Bs Bi 
ABS eRe i elsk Ba Gees 
A* By ABNBYS CNR ED AT We fea 


(1.9) Wee 


Here the functions B,(z,«), 7=0, 1, 2,3, are specific solutions of (1.1) defined 
in [1]. Suppose it is desired to know the asymptotic behavior of the elements 
of the inverse matrix U1. By direct calculation we find that 


W(B,, By, Bs) 

: 2 <hr ih 1) Wa Benbabs) 

(1.10 LOSS es ee 0 P2, D3 
) W(By, By, Bz, Bs) W(B,, By, Be) 

af oe a Wiese eres) 
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By the use of formulae (4.32)—(4.35) at the end of this article, it follows that 


erin BD. 


2(%, 2 — a) 
n aie pee y2ut2 se i ae A; (2 9 ae a) 
ei. AU ee ; 
WEB es 2 istesoeat) 
a e*r*t A, (2) 2'— a) 


The first three columns in the matrix here may be determined by means of the 
relations (1.7), although for the analysis of (1.8) these are not needed. The 
Wronskian W(By, B,, By, B;) is independent of z, since (1.1) has no third deri- 
vative term. It may be evaluated by the use of the formulae derived in Section 2, 
which give the behavior of the solutions for large values of |z|. It is found that 


(led lea) WB Bebe, De 4ar0 Bho 


The inverse of any other fundamental matrix may be found in the same manner 
by the use of the appropriate formulae in Section 4. 


2. The asymptotic properties of the solutions for large values of |z| 

The known behavior of the solutions of (1.1) for large values of |A|, while 
sufficing to determine the asymptotic properties of the elements of the inverse 
matrix, are not sufficient to establish exact formulae expressing these elements 
in terms of the solutions of (1.6). It is true that only the asymptotic behavior, 
for large |A|, of these elements is needed in the study of (1.8). However, for the 
sake of a mathematically complete discussion of the equation (1.1), we shall 
prove the exactitude of the relations listed in Section 4. Furthermore, as men- 
tioned in the Introduction, these relations greatly facilitate the writing of the 
asymptotic expressions for the elements of the inverse matrix in their most 
useful forms. In order to prove the relations, we need the leading terms of the 
asymptotic forms of the solutions of (1.1) and (1.6) for the case when A is fixed 
and the independent variable z is large in magnitude. These will be derived in 
this section. The establishment of the relations (4.1) —(4.35) will be carried out 
in Section 3. 

(i) The functions A,. 

In Section 4 of [/], the functions A, were expressed in terms of the variable 
&=27)2'. They were then evaluated by the method of steepest descents, the 
asymptotic forms (4.21) being valid for large values of |&|. In [1], z was restricted 
to a bounded domain and A was large in magnitude. However, in the case where 
|E| becomes large with A fixed (and different from zero) and |z| large, the method 
of steepest descents still applies. In this case, an examination of the calculations 
involved shows that the error term O(€+) must be replaced by the term O(|z|~#). 
Thus we have 


(2.4) Ag (z, 0) = 4 Yoo(—1)ReWBaiGate) gerd hand “B48 [4 4 O(\2|-4)] 
with 
(2.1 a) 4n(k — 2) —2argA<argz<2n+$a(k — 2) — arg, 


for zin S—C,. 
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(ii) The function By. 


From Section 3 of [7] we have 


(2.2) By = ft exp(zt — Bt) exp(g AA) dt. 
C (Bo) 
Let 
t=Prz-4r, C= 2p2. 
Then 
QB tal ee Avy 1 Ling apy 
(2.3) By =(7F) [r exp/Ee(r——| exp oe. clea) dr. 


The path of integration in the t-plane emerges from the origin with direction 
arg t=arg¢—2z, circles the origin, and tends to zero again with direction 


are — atc. 


For the purpose of evaluating By for large |¢|, we write 


15520 (SR \ ral ee Esra 
(2.4) exp|4 4-+(20) | 140% 7h(z,6,a), 
where h(t, ¢, 4) is uniformly bounded for fixed A==0, |¢|=|9|>0, and |t| <| 7]. 
Then 
ey TS a—2 ow 1 1 
B= (205) f ertexp[Et(e— Aart 
(2:5) C’(Bo) 


+73 i tT exp 


C’(Bo) 


For the first integral here we have ([2], p- 199) 


Sele ‘)]acet.aar}. 


(2.6) litaatexp ; g(r t))dr =—2nie** E40), 
where ait 
(ae Ee) £ (3) ie —}na+}n) [1+ O(|C[)] + eM 472449 144 O/E|->)T 


for |¢| large and for |arg¢|<wa. 


There remains the task of estimating the second integral in (2.5). For this 
purpose we write 


(2.8) [ ePexp|5e(r— 2) hte, ade = hh. 


C’(By) 


The integral J;, 7 =1, 2, corresponds to the path C(I ;) Shown in Figure 1. The 
function exp [¢/(t)], where f(z) =F(r-+}, has saddle points at t=-+7. The 
path of steepest descent through t=17 leaves the origin with direction arg t= 
arg¢—2a and goes to infinity with direction arg t=—am—argl for —in< 
arg¢< a. The path of steepest descent through t = —1 leaves the origin with 


direction arg t=arg¢ and goes to infinity with direction arg 7=—a—arg¢ 
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for —3a<arg¢<432. Suppose for the moment we take |arg¢|<42 and con- 
sider J,. Let C(J,) follow the path of steepest descent through t=7 to a point 
where ¢[/(t) —f(z)] =¢ f(t) —7€ is real and negative, and then let it be joined 
to C(I,) by a part along which Re {€ [f(z) — /(2)]} is constant and negative. Thus 
if | arg ¢|<— 6, the path C(J,) can be chosen so that e~‘*J, is uniformly bounded 
for |¢|=|Co|>0. Similarly, with C(Z,) being chosen to follow the path of steepest 
descent through t= —7, e'*I, is uniformly bounded for |arg¢|<a—6. 


org t = 0rg 


T -plane 


There now follows from (2.5), (2.6), and (2.7) that 


By=— i remiap (PY {ee dnottf4 + OIC FN] + 
(2.9) 4 e-iS—dt otha) (4 +0(\C|>)}}, 


where | arg¢|<<, or 

— : : Sea Anz = 
oeuty BBG eae oes es AR ells 
5 ze ei (287 2?—-42a+32) [1 + O(|z|-4)]}, 


where | arg Bz|<22. 

(iii) The function B,. 

We shall discuss in detail the solution 6,. The behavior of B, and B, can 
be found in an entirely similar manner. From [/] we have 
(241) B, = f exp (zt — Bt) exp(41? 8) dt. 

C(B,) 

As mentioned in Section 5 of [1], it is possible to choose the path of integration 
so that Re (tz) <0 and Re(A-?#8) <0 for large |¢|, provided that z lies in the sector 
S— S,. Now let 


(2.12) SL eis Ree Sea ALLE 
where 
(2.13) —2argA<argz< $n — arg. 
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Then 
(2.14) = (77) a— ie *exp[o( — =) exp <A 2( 223] dz 
or hae 
eee ies | ctexp E ¢(r—1))ac+ 
(2.15) : C’(B,) 


+09 fettexn[ Lee —4)] nee aad, 


C'(B;) 

where h(t, ¢, 2) is bounded on the path of integration for |¢|=|¢ | >0 and for 
fixed A with A= 0. 

Here arg lies in the range 
(2.16) larg fp —targA<arg6<$a-+ fargB — garg. 
Also, for large |t|, we have 
(247) da<argt+argl<ia 
while near tT =0, 
(2.18) —tia<argt—arg6 <4. 
In [1] we restricted arg B to the range 
(2.19) 2argASargB< 22+ Zarga. 
If arg B is allowed to vary through this range, it follows from (2.16) that 
(2.20) O<arg6<3a 


In the appraisal of the second integral in (2.15), the restriction that arg¢ he 
in the range (—z, 22) is necessary, and this is the case when (2.20) holds. 
For the first integral in (2.15) we have 


220 f exp | (x ~|\|dv = at HD, (C), 
C’'(B;) a 

where 

(2.22) H® , (6) ae =a ei (E+4 ma—$ x) [1+ 0((C|)] 


([2], p. 197). This last relation is valid for argé in the range (—a, 2), and 
hence for arg¢ in the range (2.16). 


Next let us consider 


(2:23) ee aren eek alee Geode 


C(B,) 
where /(t) = > (t+). For arg¢ in the range (2.16), the path C’(B,) can be 


taken so as to go to infinity in a sector where Re [¢ /(t)]|<0, and where h(z, ¢, A) 
is bounded. The function f(t) has a saddle point at t=7. The path of steepest 
descent through + =7 for the function exp [¢ f(t)| tends to zero with direction 
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arg t=arg¢ and goes to infinity with direction argt=a—arg¢ for —in< 
arg¢< 32. We may therefore choose the path of integration in (2.23) as follows. 
Starting at the origin, we follow the path of steepest descent through t =i to 
a point where ¢ (f(r) —/(z)] is real and negative. We then take a circular arc |t| = 
constant into the sector where /(t,¢, A) is bounded, and then a ray going to 
infinity in the sector where Re [¢ /(t) —7¢] is less than zero and h(r, &, 4) is boun- 
ded. On the finite part of the path which follows the path of steepest descent 
and the circular arc, e~** e°/(”) will be uniformly bounded for |¢|=|fg|>0 and 
—nz+0Sarg6<22a—0. On the ray extending to infinity, ¢ f(t) —7¢ will have 
a negative real part and A(t, €, A) will be bounded for arg¢ in the range (2.16). 
Then for arg ¢ in any closed subinterval of (2.16), and for |¢|=|Cy|>0, the func- 
tion in (2.23) will be uniformly bounded. 


It follows now from (2.15), (2.21), and (2.22) that 


(2.24) Be? |B (2F\* eil(C+haa—fn) [4 WO UC |)] 
where 

(2.24a) Zz argB —targhA<arg6<2a+4argp —targi, 
or 

(2.25) B, =i|/n pie A-ha gi(26A+ haha) (4 4 O(\2|-4)], 
where 

(2.25 a) — ZargdA<argz<4a— argh. 


The results for B, and B, are 


(2.26) B,=i |x Bie? Ade oi (287 27+ $na—}n) [4+ O(\z|-4)], 
(2.27) Be at | etnim plead habe (eRe dab) 4 1 O(\z|-4) | 
where 


(2.28) —2nx—2a(k — 2) —Zargh<argz<2a— 3m (k — 2) — Zargd, 


for zin S—S,. 


3. Derivation of the Wronskian relationships 


As was shown in the introduction, the elements of the inverse of any fun- 
damental matrix for (1.1) may be expressed in terms of specific solutions of (1.6) 
by the use of the formulae listed in Section 4. These formulae express the third 
order Wronskians of solutions of (1.1) in terms of the solutions of (1.6). In order 
to establish the validity of these formulae, we first show that the relations (4.1), 
(4.7), (4.8), and (4.12) hold. Since the four solutions involved in these relations 
constitute a fundamental set for (1.6), the other formulae may be obtained 
by expressing the other Wronskians in terms of these four by means of the 
relations 


B, papa Bs, 
(3.1) Bz+ A,=B,+ By oe eet Doll 
By =F A, Fa By, 


which were derived in [J]. 
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Consider first the relation (4.7). Taking first z in S—S, with —Zargi< 
arg z<4m—2arg A, and using the results of Section 2, we have 


1 1 B, (2, %) 
W(Ay, Ag, By) (2,0) =a Be 2-8] iad iat By (z,o)|[1+O(l-4] 

(3.2) —ftz —/*z Bi, a) 
ae och pix—4 12% zha-$ ei (2A? 2!+ $0—42) [1 +0 (\z|74)]. 


Comparison with (2.25), with « replaced by (2—«) there, shows that 
(3.3) g-datte-2iet yd, Ay, By) — hy (A) Bi(z, 2—9)] =O(lal-4), 


where 


(3.4) kid) =—2are op A 

For the case when 2 lies in the interior of the sector S,, by use of (3.1) we write 
(3.5) W(45, Ag, By) =W(Ag, Bo, Bo) — W(A,, As, By). 

Proceding as before with the Wronskians on the right here, we find that 

(3.6) W(A3, By, By) = — 'y(A) Ag (z, 2 — ) [14 + O(l2l~4)] 

and 

(3.7) W(Ay, As, Bs) = — hy (A) Ba(z, 2 —«) [1+ O(la|~4)]. 

Since B,= B,— Ag, it follows that 

(3.8) gett eh (W(As, Az, By) — hy(A) By(z, 2—a)] =O((a|-4). 


Here &, is the branch of & = 2128 which has a positive real part in S,. 


To prove (4.7), we observe that since the expression in brackets in (3.3) 
and (3.8) is a solution of (1.6), we may write it as a linear combination of a set 
of independent solutions of (1.6). Thus 
(3.9) W(Ag, As, By) — ky (A) By (z, 2 — a) 

= Cy Bo(z, 2 — a) + B,(z, 2 — a) + cy Ag(z, 2 — a) + €3A5(z, 2 — @) 


for some set of quantities c; which may depend on A but not on z. We shall show 
(iat C0. 4OL 7 Ont. 2,4, 

In (3.9), if we multiply through by z!%*4e~* and let z tend to infinity along 
a ray in So, the left-hand side tends to zero because of (3.3) and the fact that 
&, has a positive real part in S,. The right-hand side tends to 

i oe es tia—jni Ja-s ae 

Hence we must have c,=0. Similarly, if we multiply through in (3.9) by 2*+4e—§: 
and let z tend to infinity along a ray in S;, we find that é,=0. Then 
(3-10) W(As, 4s, By) — by(A) By (z, 2 — 0) = cy Bo(z, 2— a) +, By(z, 2-2). 
In (3.10) multiplying through by z)*tte~, letting z tend to infinity along a 
ray in S,, and using (3.8), we find that c,=0. Hence 


(3.14) W(Az, Ag, By) — hy (A) By(z, 2— a) = cy By(z, 2 —2). 
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If we multiply through in (3.11) by z#-!*e~2#é"#* and let z tend to infinity 
along a ray in S—S,, the left-hand side tends to zero by virtue of (3.3). The 
right-hand side has the form 


(3.12) i |loce™'*Pt-¥%cfeb™ia—-Emi[4 4 O (z|—)] ei APs + bna—de) [1+ 0 (|z|~4)]}. 


If the exponential function in (3.12) becomes large in magnitude as |z| grows 
large, the quantity (3.12) tends to infinity unless cy=0. On the other hand, if 
the exponential function tends to zero, the quantity (3.12) tends to 


1 |x Cer katte ent, 
Again, we must have cy=0. Since it is always possible to choose the ray in 


S—S, so that one of these two cases holds, we have cy=0. This proves (4.7). 


The relations (4.8) and (4.12) can be proved in the same manner as (4.7). 
Relations of the forms (3.3) and (3.8), with permutations of the subscripts, can 
be derived. With the assumption of relations of the type (3.9) and the appropriate 
changes of subscripts, it is possible to procede as in the case above. 


To prove (4.1), we consider the behavior of W(A,, A, As) for large values 
of |z|. By use of (3.1), we may write 


(3.13) W(Ay, Ao, As) = W(A,, Ap, Bo) + (?"**— 1) W(Aj, Ag, Bs). 


Then for z in S—S;, with —$a—2 arg A<argz< — arg A, we find that the 
right member of (3.13) has the form 


— 2nf pie-4 fea gad {of (26824 +4na—$2) [4 +O (\2|-4)]+ 


3.14 Seenin 
( ) 4 ei(288 b+ 5 nahn) [4 +0 (\z|-4)]}. 


Comparison with (2.10), with « replaced by (2—a), shows that 
z-8e+2/W(A,, Ap, As) — ho(A) Bo(z, 2 — «)] 


oA ue en: 
pa = ital oct) + etl O(leI-8), 
where 
(3.16) Rg (A). = Dod a Be fee 


The Wronskian W(A,, A, A3) may be evaluated by similar procedures for z in 
the sectors S— S, and S—S,, and it is found that (3.15) actually holds for all 


values of arg z. 
Since the quantity in brackets in (3.15) is a solution of (1.6), we may write 


W(A,, Ag, As) — Ro(A) Bolz, 2 — &) 


7 
ae ly Bale 2 oe, Bi (27 2a) 0, Bz, 2 — am) + 6, Baz, 2 — a). 


In (3.17) we multiply through by 2!*t4e~* and let z tend to infinity along a 
ray in S,, then the left-hand side tends to zero, while the right-hand side tends to 


1 7p eka imi Jae on 
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Hence c;=0. Similarly, if we multiply through by. 2°Fae 4 25 Mandiletaz 
tend to infinity along a ray in S,, we find that c;=0, for7=2,3. Then 


(3.18) a). 


It is always possible to find a ray in S along which e becomes large in 


magnitude. If we multiply through in (3.18) by zi-4% @ and let z tend to 
infinity along this ray, the left-hand side tends to zero, by virtue of (3.15), while 
the right-hand side tends to 

1 |ebtia— tnt ph he cy, 


— ky (A) Belz, 2 —«) =% Belz, 2— 


pie 
21 B? 2? 


W(A,jA5) As) 


21 p% 22 


Hence cy=0. This proves (4.1). 
We have shown so far in this section that 


WA Ags As) (QoS 2 er BP ee 
Ah W(A,, As, Bi) 20) = — 20 2 Be 7 ae 
us W(Ay, Aap Ba (Oa) = 20th Be el 
) (2,0) = 


W(A,, Ag, Bs) (z, « ke sas 122% Ba(z,2—«). 


Since the functions on the right constitute a set of linearly independent solutions 
for (1.6), we can determine any other third order Wronskian by the use of these 


formulae and the relations (3.1). 


may be determined as follows. 


W(A3, B,, Bo) 


= W(A;, By, By As) 


For example, the Wronskian W(A,, B,, By) 
Using (3.1), we have that 


= W(A;, B,, A.) +W 
= W(Ag, Ag, B,) + 
Then from (3.19) it follows that 


(A;, B 
W(A,,A 
W(Ag, By, By) = 2204 e*** Be 724 |B, (2 


or 


(3.20) WA;, By Bo) = 2c BAe A aa) 


4. Formulae for the determination of the inverse matrix 
Below is given a complete list of formulae which express the third order 
Wronskians of the solutions of (1.1) in terms of specific solutions of (1.6). From 


these formulae may be calculated the elements of the inverse matrix U~ cor- 
responding to every fundamental matrix solution U of (4.2). 


(AN W( As A Ag) ie ea * 12° By(z,2—a), 

(4.2) W(Ay, Ag, By) =2mi e-** Be 924 [(1 0-22!) B, (2, 2a) — By(z, 2—)], 
(4.3) W(Ae, 4g, By) = 206 e-7!2 ge ae 1(2,2—a) —Bo(z, 2—a)], 
(4.4) W(A,,A3, Bo) =2m1 6-2 Bo 2% | (4 — e240) Be (z,2—a)+Bo(z,2—a)], 
(4.5) W(Ay, Ag, By) =20t eB "1 Bo(z, 2a) — Baz, 2a), 
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W(Ay, Ay, By) = —2mie-*i* Be! 22" B,(z, 2 — 0), 
VE CA PAGM Bea Dag ¢ ok BAN Bt BR. (22 = a)) 
WiAD, Axe Biya et ep? BG; 2) ; 
W (Ag, Ag, Ba) =— 2m i c™ f° 12% B, (z, 2 —a), 
W(Az, Ag, Bg) = — 2mie7***B*-? 72% B, (2, 2 — a) + By(z,2—a)], 
W(A,, Ay, By) = 2m i c™** fp 2B, (2, 2— a), 
VWAAi. A, Bs) = 220 ee pe A BalZ, 2a), 
W(A, A, Bs) =2ni2-2 8 1 By (e222), 
VAAL WB Be\i08 
W(A,, B,, Bs) = — 26 e-7** fe! J2* B, (2,2 —2), 
W(A,, B,, By) =— 206 ct'* Bp" 92" B, (2, 2 —2), 
W (A>, By, By) = 2m e-*** p92" A, (z, 2 — a), 
W(Az, Bs, Bs) =—2ate-" *p" 11+ B,(z,2 —«), 
WAS By, Bo) 20046" Bo LA B22 =a), 
Wide By Bio; 
W (Ag, By, Bs) =20i e**B* 2B, (z,2 —a), 
W (Az, By, Bs) =201e* f°! 2*B,(z, 2 — a), 
W(A,, By, By) =—22ie~™'* Bt 124 | Ba (z,2—cx) —B,(z, 2—a) + By(z, 2—«) J, 
W(A,, Bo, By) = 2706 e-*** f° 2A, (2, 2 — a), 
VACA, in eG pi 2704 Bre AP TA a2 a), 
Wee Be B21 ce OB A AS Go — 
WAs Bor Bo) Sout eRe Pee B, (232 4) = Bale 2:=o) | 
W(Aq, By, By) =2mie-*'* p> 2* A, (z, 2 —a), 
WiASB Boe 200) oF Bae e A? A, (2.20), 
Wide B.. By) saver BAL, 2 a), 
WiAg Bg, Bo) 2016" Beh Pt | PF" Boz, 20) — 
— B,(z,2—«) — By(z, 2—«)], 
WiBe Ba, Bs) 220 6s) A B32, 2— 4), 
W(Bo, By, Bs) =— 201 67% Bt} 24 Ay (z, 2 — a), 
VA Beeb) = — Qrete ie Bs OA, (2 2), 
W(Bo, By, Bs) =2i e777" 2A, (2,2 —a). 
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In ehlerabschatzungen 
bei gewohnlichen und partiellen Differentialgleichungen 


JOHANN SCHRODER 


Vorgelegt von L. COLLATZ 


Es werden Fehlerabschatzungen fiir Funktionen w*(x) hergeleitet, welche in 
einem beschrankten Gebiet 8 einer Differentialgleichung 


M[u] =f (x, u) 
gentigen und auf dessen Rand gegebene Zusatzbedingungen 
Ulul=yi  (@ =4,2,...) 


(Rand-, Anfangs- oder andere Bedingungen) erfiillen. x reprasentiert # unab- 
hangige Verdnderliche, M und die U, bedeuten formale lineare, homogene Dif- 
ferentialoperatoren, zu denen es eine ,,Greensche Funktion‘ G(x, €) mit gewissen 
Eigenschaften gibt. Alle vorkommenden GréBen seien reell!. Die bewiesenen 
Satze enthalten auch Existenzaussagen. 

Die Naherung, deren Giite abgeschatzt werden soll, wird ermittelt, indem man 
einen oder mehrere Schritte des Iterationsverfahrens 


M[tn+1] = f(x, Un) » U;[Uy +41] Kes (p= 4,275 ..; n ='0, 1,2, .::) (0.4) 


durchfiihrt. Praktisch kommt man oft mit einem Iterationsschritt aus. Man 
beginnt dazu mit einer auf irgendeine Art erhaltenen moglichst guten Naherung 
U,(x) (Ss. Beispiel 1) oder versieht die Ausgangsnaherung u(x) mit Parametern, 
die man dann so bestimmt, daB wu, (x) sich von u(x) in geeignetem Sinne médg- 
lichst wenig unterscheidet (Beispiel 2). Oft ist es auch méglich, u(x) ,,riickwarts‘ 
von u,(x) ausgehend zu berechnen. Man braucht dann keine Differentialgleichung 
zu losen (s. Nr. 2.4 und Beispiel 3). 

Die Fehlerschranke o(x) fiir |w*(«) —u,(x)|, u(x”) bezeichnet hier die letzte 
nach (0.1) berechnete Naherung, erhalt man im Falle G(x, &)=20 als Lésung 
einer ,, Vergleichsaufgabe‘*? 


M{o|=f (x, a(x) + 0,(x)) mit | #1 (x) — uo(x)| So, (x), (0.2) 
Ulo]=0 (@=1,2,..), (0.3) 
1 Die Ergebnisse lassen sich z.T. auf komplexwertige Differentialgleichungen 


iibertragen. 

2 Die Beziehungen (2.9), (2.10) sind von etwas allgemeinerer Form, sie enthalten — 
ebenso wie (1.16) — noch eine Funktion @)(¥). Gewohnlich kann man jedoch @)(«) = 0 
setzen. 
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in der also statt einer Differentialgleichung eine Differentialungleichung vor- 
kommt. Auch die Zusatzbedingungen (0.3) lassen sich in vielen Fallen durch 
Ungleichungen ersetzen (s. die Nrn. 3.2 und 4.2). Nimmt G(x, €) auch negative 
Werte an, muB man an Stelle der Beziehungen (0.2), (0.3) die Integralungleichung 
(1.16) verwenden. 

Die Funktion I(x, y) ist eine Majorante der Funktion /(x, y) (im Sinne der 


Beziehungen (4.5)) mit f(x, 0)=0. Sie kann in y linear oder nichtlinear sein. 


Die geeignete Wahl von /(x, y) wird in Nr. 2.3 und in den Beispielen der §§ 3 
und 4 erldutert. Im Spezialfall des in Nr. 2.4 behandelten vereinfachten Newton- 
schen Verfahrens ist es z.B. besonders zweckmaBig, eine in y nichtlineare Majo- 
rante zu benutzen. 

Bei der Herleitung der Ergebnisse stiitzen wir uns auf Satze tiber das Itera- 
tionsverfahren u,,,— Tu, in abstrakten Raumen, welche in [//] als Spezialfall 
(und vorher in etwas anderer Form in [10]) bewiesen wurden®. Fiir bestimmte 
Aufgaben bei gewohnlichen Differentialgleichungen wurden Fehlerabschatzungen 


ahnlicher Art mit in y linearer Majorante /(x, y) bereits in [8] mitgeteilt. Aller- 
dings wurde dort eine zusitzliche, hier nicht erforderliche Konvergenzbedingung 
benotigt. Die Fehlerschranken, welche der bekannte Fixpunktsatz fiir kontra- 
hierende Abbildungen [12], [3] liefert, sind in den hier fiir lineare Operatoren 
hergeleiteten enthalten (s. Nr. 2.3). 

In §1 werden die Satze aus [//] auf Integralgleichungen angewendet, in § 2 
die dabei gewonnenen Ergebnisse auf Differentialgleichungen iibertragen. Die 
§§3 und 4 behandeln spezielle einfache Klassen von Rand- und Anfangswert- 
aufgaben bei gewohnlichen und partiellen Differentialgleichungen und bringen 
numerische Beispiele. 

Zur Rechnung der numerischen Beispiele wurde die am Institut fiir Ange- 
wandte Mathematik der Universitat Hamburg aufgestellte Rechenanlage IBM 650 
benutzt (s. insbesondere Beispiel 1). Eine Vereinigung von funktional-analytischer 
Theorie und der Verwendung elektronischer Anlagen fiir die praktische Rechnung 
scheint in der numerischen Analysis Erfolg zu versprechen. Zum Beispiel ware 
es im Sinne dieser Arbeit sehr erwiinscht, Verfahren fiir Rechenanlagen zu 
entwickeln, mit denen man in einer mit Parametern versehenen Funktion F 
diese Parameter so bestimmen kann, daB F eine gegebene Funktion von einer 
Seite her méglichst gut annahert, denn auf eine solche Aufgabe fiihrt die Er- 
mittelung einer Schranke o(x) aus (0.2), (0.3). 


§ 1. Integralgleichungen 
1.1. Aufgabenstellung. % sei ein (nicht notwendig offenes) beschranktes Gebiet 
des p-dimensionalen Euklidischen Raumes. Die Punkte dieses Raumes bezeichnen 
wir mit x oder €; y und z bedeuten dagegen reelle Zahlen. Aussagen iiber Funk- 
tionen von x sollen auf dem durch AbschlieBen von 8 entstehenden Gebiet 8 
gelten. Gesucht sind stetige Funktionen u(x), welche einer gegebenen Integral- 


gleichung u(x) = g(x) a G(x, &) f(é,w(€)) dé 


. 3 Die dortigen Ergebnisse enthalten auch Eindeutigkeitsaussagen. Auf solche ver- 
zichten wir hier. 
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gentigen. Die Funktionen g(x), G(x, &) und f(x, y) mégen dabei folgende Eigen- 
schaften besitzen. 

a) g(x) sei stetig. 
b) G(x, &) sei auf 8 xB definiert, abgesehen héchstens von Argumentepaaren 
x,€ mit ~=&. Zu jedem e>0 gebe es eine Zahl 6(e) derart, daB 
SG it) = CG's) diez ov tire x, 4/ eB) mit ly =e’) <6 44) 
8 


gilt, wobei |«—x’| den Euklidischen Abstand der Punkte x und x’ bedeutet. 
Bei beliebigen stetigen Funktionen v(x) und s(x) mit |7(x)|<s(x) gelte 


| { Xr(e) 48] s f|K]|r(@)| 48s f|K| (6) a8, (1.2) 
B B B 


wenn K die Funktion G(x, €) oder eine Differenz G(x, &)—G(x’, &) (x, x’€%) 
ist. Dies bedeutet gleichzeitig, daB die auftretenden Integrale (im Sinne irgend- 
einer der tiblichen Definitionen) existieren sollen. 


Die Funktion heres. é)| erfiille ebenfalls die von G(x, €) geforderten Voraus- 
setzungen. 


c) Die Funktion /(x, y) sei stetig auf einem ((p + 1)-dimensionalen) Gebiet# 
®: «EB, v(x)SySy(x) — (p(x), v(x) auf B definiert). (1.3) 


Zu einer geeignet gewadhlten, in & verlaufenden stetigen Funktion w(x) gebe es 
eine auf einem Gebiet* 


6: xEB, OX<y<d(x) (0(x) auf B definiert) (1.4) 


erklarte stetige Funktion f (x, y) derart, daB 


~ 


i(x, 0) =0, 
O< f(x, y+2)—f (x, 9) Sfx, +2) f(x, ') fir OS ySy',0SzS7, (1.5) 
lily) — He) Si (x ly — 2] +e — # Ml) — Fle z — @)) 


gilt, soweit die jeweils vorkommenden Argumentepaare in & bzw. 6 liegen. 


Die gegebene Integralgleichung soll mit dem Iterationsverfahren 
Un+1(*) = 8 (%) SP G(x,€) f(E,u,(€))4& (w= 0,1, 2,...) (1.6) 


gelést werden. Dabei sei u(x) stetig und verlaufe in ©. 
Die Voraussetzungen lassen sich mildern: g(x) darf unstetig sein. Ist dies 
der Fall, so sind Lésungen der Form 


u(x) = g(x) + u(x) (1.7) 
4 Die beiden Zeichen < in (1.3) und das rechte in (1.4) kénnen z.T. oder ins- 
gesamt durch < ersetzt werden; auch die Falle p=— cw, y= +c, B=-+ c sind 


zugelassen. Man wird sogar im allgemeinen # = + oo setzen. In entsprechender Weise 
sind dann die weiter vorkommenden Ungleichungen abzuandern, welche 9, y oder 3 
enthalten. 
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mit stetigem @(x) gesucht. Die Ungleichungen (1.2) mégen dann auch fiir alle 
Funktionen 7 und s mit |v|<s gelten, welche sich in der Gestalt f(x, ~) mit einer 
Funktion w der Art (1.7) schreiben lassen, und w und wu sollen dann ebenfalls 
die Gestalt (1.7) haben: w=g+W, up=g+Upo. 

1.2. Beispiele fiir Majoranten {(x, y). i (x, y) bezeichnen wir als Majorante 
der Funktion /(x, y) beziiglich w(x). Im allgemeinen wird man w(x) =%(%) 
waihlen. Meistens laBt sich #(x) = co setzen. Im folgenden werden Beispiele fiir 


solche Majoranten angegeben. Wir nennen /(x, y) oder fi (x, y) linear, falls diese 
Funktionen in y linear sind, andernfalls nichtlinear. Eine lineare Funktion 


f(x, y) ist auch homogen in y, bei f(x, y) braucht dies nicht der Fall zu sein. 
Es sei nun /(x, y) eine auf einem Gebiet & (1.3) erklarte stetige Funktion 
und w(x) eine beliebige in & verlaufende stetige Funktion. 
Hilfssatz 1. Die A bleitung 1, (x, y) extstiere und set stetig auf &. Dann erfillt 


eine auf einem Gebiet & (1.4) erklarte stetige Funktion H(x, y) die Voraussetzungen 
(1.5), wenn sie eine stetige, nichtnegative, mit y monoton nichtfallende Ableitung 


i, (x, y) besitzt und 


7 


sowte 


i (x, 0) 
)) 


0 (1.8) 
lt, (x, ¥ + (x) Shy ( 


< f, (x, |y|) 


gilt, soweit die Punkte x, y+ w(x) und x,|y| in & bzw. & liegen. 


Beweis. Es ist (soweit die vorkommenden Funktionswerte erklart sind) 


(x, 9 +2) —H(x,9) =Jh,(%y +42) 2dt= 0 


und 
nb 


fi, (x,y +¢2) ie, (x, y+ #2") 2' dt = f(x, y' +2’) — f(x,y’) 


0 0 
fir AOS oy Oe 


Ferner erhalt man 
May) Heal =| Siete —a) 2) af 
5 flij(s2—w+ ty —2) +0) ly —2| at 
SS, (x)|2—w + ty —2)) ly a] at 


Sf t,(%,|2—w| +t|y 2) |y—2] at 


=i (x,y —2| + |2— wl) —7(2,|2—a)). 
Aus Hilfssatz 1 folgert man leicht den 


Hilfssatz 2. Die Funktion f(x,y) sei fir xCB, — oo<y< oo erklart und 
besitze in dresem Gebiet eine stetige Ableitung Ley Gis 


[f,(*, —y+w (x))| S7,(x, yt+-o(x (x)) S4,(%, 9+ w(x)) fir 0X y<y'<o, (1.9) 
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so ist 


I(x, ¥) =H(x,y + w(x)) — f(x, w(x) (1.10) 
eine fiir x CB, OX y< co defimerte Majorante der Funktion f(x, y) beztiglhich w(x). 
Polynome in y als Majoranten liefert der 


Hilfssatz 3. Die Funktion f(x, y) besttze auf & stetige Ableitungen 


f)'(«,9) = ,y) (i =0,1,2,...,.m—1; m=1), 


und es gebe stetige Funktionen a,;(x), mit denen 
| f° (x, w (x))| S a, (x) ((==41,2,...,92 — 1), 
oe a eee far x€B, y(x)<y,2Sy(a) (1-11) 


gilt. Dann ist die fiir «CB, 0S y< © erklérte Funktion 


m 


f(x, 9) =>)  as(e) 9» 


eine Majorante der Funktion f(x, y) beztiglich w(x). 
Gilt z.B. fiir die Funktion f(x, y) selbst eine Lipschitz-Bedingung 


li(x,9) —#(%2)| Sa (x) |y —2|, (1.12) 
so kann man die lineare Majorante ii (x, vy) =a, (x) y verwenden. 


Beweis des Hilfssatzes 3. Fiir m=1 ist die Richtigkeit der Behauptung 
leicht einzusehen. Im folgenden sei m=22. Man berechnet 


f(x, 9) — fle) = S41 (x, w(x) [ly — 09)" @ — eM) +ER (1.13) 


Raff h ff fox, w ty — 20) — FON x, w)] x 
x(n — w)"-? (y — 2) dtydt,... dt 


m—1 


und 7 =z+4,,_1(y—42), indem man die Integrationen ausfiihrt. Ferner schatzt 
man ab: 


l(y—@) — (¢—»)|'=|[ly —2) + @ —@)]'— (¢—@)'| S (ly—2| + 2 —-)— [2 -w 


sowie mit (1.11): 


il Shkaly th 


IRIS ff fe fem(2) Aly — ol" |y —2| diy... dt 
Oro 6 


a m—1 (y oo 2) ly > Z| Aty—1 
0 
z 


sete f (eel ttle — ally 2] dbs 
0 


= S0lF) [ily —2| + le — wl)"—|z— |"). 
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Insgesamt ergibt sich dann aus (1.13) 
f(xy) — faa) SA asx) [ly — 2 + 2 — el) [2 — #1]. 
i=1 
1.3. Funktionalanalytische Formulierung der Aufgabe. KR bedeute die Menge 
der Funktionen u(x), welche sich in der Form 


u(x) = g(x) +%(x) 


mit auf B stetiger Funktion “#(x) schreiben lassen. (Bei stetigem g(x) ist }t also 
die Menge der auf 8 stetigen Funktionen. Nur unter den in Nr. 1.4 genannten 
schwicheren Voraussetzungen tiber g(x) muB man w(x) in dieser Form darstellen.) 
Fiir jedes Elementepaar wu, v€ }t wird durch 


d (u,v) =|u(x)—v(x)| (= |# (x) —2(x)]) 


ein Abstand 6(w, v) definiert. Diesen fassen wir als Element des Raumes % der 
auf 8 stetigen Funktionen auf. Fiir die Elemente 9, o, ... des Raumes Xt werden 
die Addition @ +o, die skalare Multiplikation «@ und die Beziehung @So in der 
bei Funktionen tiblichen Weise erklart. Sind 9 und 0, (m =1, 2,...) Elemente 
EN, so bedeute o =lim o,,, daB die Funktionenfolge @,,(x) fiir 7— oo gleichmaBig 
auf B gegen o(x) konvergiert. 

In R wird ein Grenzwertbegriff fiir Folgen wu, (wn =1, 2, ...) dadurch definiert, 
daB u lim u, der Beziehung lim 6 (wu, w,) =O aquivalent sein soll. Dieser Limes- 
Begriff bedeutet ebenfalls die auf 8 gleichmaBige Konvergenz. ist vollstandig 
in dem Sinne, daB jede Folge u,(x), fiir die lim d(u,,, u,) =O bei beliebiger 
monotoner Folge k, gilt, ein Grenzelement u€% besitzt (lim 6(u, uU,) =0), 

Es sei nun D die Menge der Funktionen w€ WR, fiir welche die Punkte x, u(x) 
in ® liegen, und 7 der auf D durch 


Tu = g(x) a G(x, £) f(€, (8) dé 


definierte Operator. Wegen der von G(x, &) verlangten Eigenschaften ist das 
Integral eine stetige Funktion von x, T bildet also D in R ab. Die Gleichung 
u—=Tu ist dem Ausgangsproblem aquivalent, und (41.6) laBt sich in der Form 


Uy p= Lh, n= O42 2 ae (1.14) 
schreiben. 


1.4. Abschiitzen des Operators T. Wir wollen einen in [11] bewiesenen Satz 
benutzen, um Aussagen iiber das Iterationsverfahren (1.14) zu erhalten. Zu 
diesem Zwecke schatzen wir den Abstand 6(Tu, Tv) ab. Wegen (1.5) ist 


6(Tu, Tv) =|Tu— To| 
= ae E) [/(6, 4) —f(6,v)] dé =) | G(x, &)| |A(E, «) —F(G, »)| dé 


SJ \G(x, 8)] [FE 14 — 0] + |v — wl) — FG, |v — w))] ae. 
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Das ist eine Ungleichung der in [11] als Spezialfall behandelten Form 
6(Tu, Tv) SF [6 (u,v), d(u, w), d(v, w)] 


mit 


Fit, e°, o°] = P(e + @*) — Pe 
und dem auf der Menge ® der in & verlaufenden stetigen Funktionen o(x) durch 
Po =S| G(x, 8] 1(E o(6) a8 
definierten Operator. 


1.5. Untersuchen des Vergleichsverfahrens. Konvergenzaussagen und Fehler- 
abschatzungen fiir das Verfahren (1.14) erhalten wir nach [1/1], §1 mit Hilfe 
eines Vergleichsverfahrens 


Cee hoe GE Oriae (4.15) 
T bedeutet dabei den durch 
Ing Of Oo E O'— P Oy 

aus D definierten Operator; 0, und o, sind stetige Funktionen mit 

| %o(%) — @(x)|Seo(x), | (*) — mo(*)| Sai(*), 00 (*) + a1 (*) S P(x) 5. 

Wir untersuchen die Konvergenz dieses Vergleichsverfahrens. Gibt es eine 
stetige Funktion t(x) mit @9(%)<t(x)<@(x), welche der Ungleichung 

Cas Tr 


geniigt, so ist die Iteration (1.15) nach Zusatz 1 in [7] unbeschrankt durchfiihr- 
bar, die Folge @,, nimmt monoton nicht ab und ist durch t nach oben beschrankt. 
Die Folge o0,,(«) konvergiert also punktweise gegen eine auf B erklarte Funktion 
o*(x). eh 

Wir beweisen, daB die Konvergenz auf 8 gleichmaBig ist. Die Funktionen 
0, (x) (w=1, 2, ...) sind gleichmaBig beschrankt und gleichgradig stetig, denn es 
gilt 

0S @,(*) St (x) S Max t(*) 
B 


und infolge (1.1) mit der stetigen Funktion v(x) =o,+ @)— Po bei gegebenem 
e>0 


| 0, (x) — 0,(x’)| S|xz(*) =a (X) sg te ©) =e (58) | F(E, On —1) aé 
< |x) — x10) 4/169) — 6 j|ig.2) de 
<|x(x) — x(x’)| #1 16(218) — Ge &)| dé. Max f (2, Aca ee 


fiir geniigend kleine Werte |x—x’|. Die Folge @,(x) enthalt also eine gleich- 
maBig konvergente Teilfolge und konvergiert damit selbst gleichmaBig, da sie 
monoton ist. o*(x) ist also stetig und es gilt e*=limg, im Sinne unserer 
Definition. 


5 Die Méglichkeit einer solchen Abschatzung mit @)+ 0, S@ wird vorausgesetzt. 
Siehe FuSBnote 6. 
Arch. Rational Mech. Anal., Vol. 2 25 
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o* erfiillt die Gleichung 9* = T 0*, denn wegen der gleichmaBigen Konvergenz 


~ 


der Folge o,,(x) gegen o*(x) und der Stetigkeit der Funktion f(x, y) ist bei gege- 
benem ¢>0 


pf 7 6" | =o" On| |T o* Fo, -1| 
=|o*—enlt] [E(x 6) ((E 0%) — HE Qna)] aE 
® ~ a 
= |o*(*) —en(*)| + Max S\G(x, é)| dg: ces f(x, o*(x)) f(x, On—1())| <é 
g 8B B) 


fiir geniigend groBes m. Das hierbei auftretende Maximum der Funktion 
S| G(x, &)| d& existiert, da diese Funktion infolge der von G(x,&) verlangten 
B 


Eigenschaften stetig ist. 
Praktisch wird man im allgemeinen nicht t ermitteln, sondern o = t —o,— @p. 
Diese Funktion mu8 dann der Ungleichung 


Oe TO Oi Og) ae al 
genugen. 
1.6. Ergebnis. Mit Satz 1 aus [//] erhalt man unter den in Nr. 1.1 genannten 
Voraussetzungen das folgende Ergebnis fiir das Iterationsverfahren (1.6) zur 
Lésung der gegebenen Aufgabe. 


Satz 1. 09(x) und o,(x) seten stetige Funktionen, welche die Bedingungen 
| % (x) — w(x)|Seo(x), |m (x) — Ho()|Soy(*), — @o(*) +.0(x) S B(x) § 
erfiillen. Gibt es eine stetige Funktion o(x) mit 
OSo(x) und o(x) = B(x) — o9(x) —0,(x) §, 
welche der Ungleichung 
a(x) = | |G (x, &)| [Eo +0118) + a0() — FE, oo ())] dé (1.16) 


gentigt und gilt (x) +o(x) <m(x) <p(x) —o(n), (487) 


so ist durch die Iterationsvorschrift (1.6) eine Folge von Funktionen u, (x) (n=O, 
1, 2, ...) defineert, welche gleichmdpig auf 8 gegen eine Lésung u*(x) der gegebenen 
Aufgabe konvergiert, und es gilt die Fehlerabschatzung 


| u*(x) — u,(x)| <o(x), (1.18) 
sowre allgemeiner 
| w*(x) — u,(%)|St(x) —@,(x) (m =0,4,2,...) 
mit T=G +0,+ 0) und den durch (1.15) definierten Funktionen Oye). 
Die Voraussetzung (1.17) hat zur Folge, daB D alle Funktionen wE€R mit 
|1(x) — a4 (x)| So(x) (1.19) 


enthalt. Nach Fu note 5 in [//] braucht D aber sogar nur die (1.19) geniigenden 
Funktionen wc TD und deren Haufungselemente zu umfassen. Deshalb gilt der 


® Im Normalfall = oo ist diese Ungleichung immer erfiillt. 


Fehlerabschatzungen bei Differentialgleichungen 375 


Zusatz 1.1. Die Forderung (1.17) léBt sich durch 
Max [®(x), a(x) —o(x)]=o(x), Min [P(x), (x) +o(2)j)<yla) (1.20) 
ersetzen, wenn D(x) und W(x) irgendwelche Funktionen bedeuten, nuit denen 
D(x) S g(x) +f G(x, £) f(é, y) dé S P(x) (1.21) 
fiir 
gilt. 
(1.20) ist z.B. immer erfiillt, falls p(x) < @O(x) SV (x) Sy(x) ist. 


Mit dem folgenden Zusatz 1.2 kann man einfache Schranken o(x) gewinnen, 
wenn man (1.16) nicht direkt lésen will. In diesem Zusatz benutzen wir weitere 


Voraussetzungen: Die Funktion f(x, y) sei fiir x€B, ox y<oo erklart. Es 
gebe stetige, nichtnegative Funktionen V(x) und W(x), eine fiir «20 definierte 
stetige Funktion (a) sowie Konstanten a, «, und y>0O mit folgenden Eigen- 
schaften: 


p(0) =0, 

f(x, (x +B) W(x)) — f(x, «W(x)) [ple +B) —p(a)]V(x) fir «B20, (1.22) 
| Ho (x) — w(x)| Sa%q W(x), — | m1(x) — Mo (x)| pa 

JG &|V(é) dé Sy W(x). (1.23) 


Zusatz 1.2 Gibi es unter den obigen Voraussetzungen eine Zahl n=O mit 
n2Zv len + O% + %q) — Pp (a) |, 


so gentigt die Funktion fe 
a(x) = f G(x, 8) VE) as (1.24) 
B 


(und ebenso a(x) = W(x)) (1.25) 
der Ungleichung (1.10) bet Q9=%)W (x) und o,=%,W(x). 
Die Richtigkeit dieser Behauptung folgt mit o<7W aus 
SIG| [FE + 01+ 00) — FE, o0)] 4 SS |G| [HE (7 + 24 + a0) W) — f(E %W)] dé 
<[p(y +o + %) —d(m)] [GV aes 7 [|G|Vaésnw. 


Bei linearer Majorante f(x, y) =a,(x)y benutzt man im Zusatz 1.2 zweck- 
maBigerweise p(x) =a. Die Ungleichung (1.22) erhalt dann die einfache Form 


a, (x) W(x) S V(x), 


sy , so daB dann die Schran- 


und unter der Voraussetzung y<1 ergibt sich 7 = 
ken (1.24) und (1.25) in 


Cla 


(baw. o(x) = oS W(x) (1.27) 
iibergehen. 
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§ 2. Differentialgleichungen 
2.1. Aufgabenstellung. Wie in Nr. 1.1 sei 8 ein (nicht notwendig offenes) 
beschranktes Gebiet des p-dimensionalen Raumes. Sein Rand werde mitey: 
bezeichnet. Aussagen iiber Funktionen von x sollen auch hier auf B= S+L 
gelten, sofern nichts anderes gesagt wird. Gesucht ist die Lésung u*(x) einer 


gegebenen Differentialgleichung 
M{u] =f(x,u), (2.4) 


welche auf J" (oder Teilen von J") gegebene Randbedingungen 
U,[u] =; (C6 ee ean (2.2) 


erfiillt. M und die U, seien (formale) lineare, homogene Differentialoperatoren. 
Unter ,,Randbedingungen“ verstehen wir dabei im erweiterten Sinne irgend- 
welche ,,Bedingungen auf dem Rande“. Damit werden dann unter anderem auch 
Anfangswertaufgaben erfaBt (s. § 4). 


Als Lésung einer Aufgabe 


M[uj =6 (oder auch M[u]= 6), (2.3) 

tale (eres (2.4) 
oder 

Title. een (2.5) 


bei gegebener Funktion b(x) oder b(x%, u) (oder einer anderen M{u] und die U, || 
enthaltenden Aufgabe) bezeichnen wir eine stetige Funktion mit bestimmten von 
der speziellen Aufgabenstellung abhangenden Differenzierbarkeitseigenschaften, 
welche auf 8 der Differentialgleichung (bzw. -ungleichung) (2.3) geniigt und auf 
I’ die Randbedingungen (2.4) bzw. (2.5) erfiillt. 

Die Aufgabe 

Miu|=0, Uful =y,; tee Toe ae) 

besitze eine Lésung g(x). Ferner existiere eine ,,Greensche Funktion‘ G (x6) 
mit folgenden Eigenschaften: 

a) G(x, &) erfiille die in Nr. 1.1 genannten Voraussetzungen b). 

b) Ist die Aufgabe 


M[ul=r(x), Uf[u]=y; (C412) 
fiir eine gegebene stetige Funktion v(x) lésbar, so habe die Lésung die Gestalt 
u(x) = g(x) uA, dé. 


c) Das Ausgangsproblem (2.1), (2.2) sei der Aufgabe dquivalent, eine stetige 
Losung der Integralgleichung 
u(x) = g(x) aed é) f (&, u(&) dé (2.6) 
zu ermitteln. 


d) Die Funktion /(«, y) habe die in Nr. 1.4 unter c) genannten Eigenschaften 
(beztiglich der geeigneten Wahl des Gebietes @ s. Nr. 2:3) 
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Zur Lésung der Aufgabe (2.1), (2.2) wird das Iterationsverfahren 
M[u,41] =f(%, u,), U; [Uni] = Yi (7 =1,2,...; nm =0,1,2,...) (2.7) 


angesetzt. Soweit die Funktionen w,,,,(*) hierdurch bei stetigem w(x) erklart 
sind, gentigen sie den Gleichungen (1.6) mit den hier definierten Funktionen g(x) 
und G(x, é). 

Die Voraussetzungen iiber g(x) koénnen entsprechend wie in Nr. 1.1 gemildert 
werden: g(x) braucht nicht stetig zu sein. Statt der Stetigkeit wird dann von 
der Lésung u(x) einer Aufgabe der Form (2.3), (2.4) mit den inhomogenen Rand- 
bedingungen nur gefordert, daB w(x) sich in der Gestalt (1.7) mit stetigem 7 (x) 
schreiben ]a8t, und die vorausgesetzten Differenzierbarkeitseigenschaften werden 
ebenfalls von u(x) verlangt. Auf diese Weise kann man weitere Randbedingungen 
erfassen. Sind z.B. auf I” stiickweise stetige Funktionswerte vorgeschrieben, so 
ist g(x) auf B nicht stetig. Jedoch wollen wir diesen Fall der unstetigen Funk- 
tion g(x) im folgenden nicht weiter beriicksichtigen. 

2.2. Ergebnis. Unter den in 2.1 genannten Voraussetzungen erhalt man mit 
§1 fiir die gegebene Aufgabe (2.1), (2.2) den 

Satz 2. a) Ist fiir eine 1m Gebtet & verlaufende stetige Ausgangsnaherung ug (x) 
durch (2.7) eine erste Naherung u,(x) definiert, so gilt mit diesen Funktionen 
Uo (x), U(x) wortlich der Satz 1’. 

b) Im Falle 


G(x,6)20 fir xE  (x,€€R) (2.8) 

kann man statt (1.16) fordern, daB o(x) eine Lésung der Bezxvehungen 
M{o] =f (x,0(x) +.01(x) + e0(x)) — F(x, @0(4)), (2.9) 
Ulo]=0 (¢=1,2,...) (2.10) 


mut stetigem M{o} set. 
Beweis. a) ist unmittelbar klar. Aus (2.9), (2.10) folgt mit (2.8) 


o=[GM[o] dé 2 fG [f(E,.0 +01+ eo) —f(E, @)] 4é, 


so daB auch b) richtig ist. 
Setzt man w(x) =u)(x) und @)(x) =0, so geht (1.16) in 


o (x) So é)| t(£,0(€ +0,() dé 


und (2.9) entsprechend in 
M[o}= f(x, 0(x) +(x) 


iiber. Bei linearer Majorante F(x, y) erhalt man immer diese einfacheren Un- 
gleichungen. 

Im allgemeinen ist es bei nichtnegativem G(x, &) bequemer, o(x) als Losung 
von (2.9), (2.10) zu bestimmen. Allerdings hat die Integralungleichung (1.16) 
mehr und darunter meistens auch einfachere Lésungen, denn es wird dabei nicht 


7 e(x) und G(z, &) bedeuten darin die in Nr. 2.1 erklarten Funktionen. 
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gefordert, daB o(x) den Randbedingungen (2.10) geniigt. Diese Randbedin- 
gungen verhindern es z.B. im allgemeinen, daB eine Konstante als Lésung von 
(2.9), (2.10) zu verwenden ist. Siehe jedoch Zusatz 2.3. 


Zusatze 2.1 und 2.2. Die Zusdtze 1.1 und 1.2 sind hier unmittelbar zu ver- 
wenden. Im Falle (2.8) einer nichtnegativen Funktion G(x, &) kann man statt (1.23) 
auch die stdrkere Forderung stellen, dak W(x) eine Losung der Bezehungen 


M[w]= ; Vix), U{[W]=o (=1,2,...) 
mit stetigem M[W] sei. Hat ferner die Aufgabe 
M[o] =~ Vi@\.) Uigh= 6 Gc A oie) 


im Falle (2.8) eine Lésung a(x), so ist dies die im (1.24) stehende Schranke. 


Zusatz 2.3. Die Bedingungen (2.10) lassen sich durch die Forderung ersetzen, 
dap die Aufgabe ; 
Mii) == OF > Onl leer Oslo (6.22452 pace) (2.11) 


eine nichinegative Losung besitze. 
Diese Méglichkeit wird in den Nrn. 3.2 und 4.2 ausgenutzt. 


Beweis. o(x) erfiille die Ungleichung (2.9), M[o] sei stetig, und die Aufgabe 
(2.11) habe eine nichtnegative Lésung z(x). fe die Funktion 6 (x) =o (x) —z(x) 
gilt dann 

Mio) == Mio nla) Ola Pian eee 
mit 
r(x) =f (x, 0 +0, + Qo) — (% Qo) - 
Es ist daher 
a(x) ee) M\o\ dé RES é)7(é)d&é=0, 


so daB man wegen 6(”)<o(x) weiter abschatzen kann: 
r(x) 2 f(x, 6 +0, + Qo) — f(x, @0)- 


6(x) gentigt also den in Satz 2 fiir o(«) geforderten Bedingungen (2.9), (2.10), 
und statt 6() ist auch o(x) als Fehlerschranke zu verwenden. 


Bei manchen Aufgaben ist es auch nicht erforderlich, daB w,(x) die Rand- 
bedingungen genau erfiillt (s. z.B. Nr. 3.2). 


2.3. Vergleich mit bekannten Abschatzungen. Evlduterung der Ergebnisse. Wir 
vergleichen die bewiesene Fehlerabschatzung (1.18) mit denjenigen, welche man 
mit Hilfe des bekannten Fixpunktsatzes fiir kontrahierende Operatoren [12], [3] 
bekommen kann. Zunachst nehmen wir an, es werde eine lineare Majorante 
verwendet. Dann liefert der Zusatz 1.2 die Schranken (4.26) und (4. 27) Die 
grébere dieser Schranken (1.27) kann man auch mit dem erwahnten Fixpunktsatz 
erhalten, indem man den numerischen Abstand 


6(u, v) = Max | sel (2.12) 
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benutzt. Wahlt man dabei wie hier als die Menge der stetigen Funktionen, 
so muB noch W(x)>0 gefordert werden. (Es ist aber méglich, in (2.12) auch 
Funktionen W(x) zuzulassen, die Nullstellen haben, indem man andere Funk- 
tionenmengen %t betrachtet. Derartige Schwierigkeiten treten hier jedoch nicht 
auf.) Die Schranke (1.26) stellt gegentiber (1.27) schon eine Verbesserung dar; 
z.B. hangt sie auch im Falle der Gewichtsfunktion W(x) =1 von x ab, wahrend 
(1.27) eine Konstante liefert. Man erhalt aber noch genauere Fehlerabschatzungen, 
wenn man o(%) direkt aus (1.16) bzw. (2.9), (2.10) berechnet. 

Ist f(x, y) linear, so wahlt man auch eine lineare Funktion f(x, y). Wir 
untersuchen nun, welche Vorteile entstehen kénnen, wenn man bei nichtlinearem 


/(x, y) eine nichtlineare Majorante f(x, y) verwendet. Dazu besprechen wir zu- 
nachst die Wahl des Gebietes © (1.3). Als & kann man nicht immer dasjenige 
Gebiet ansetzen, in dem sich f(x, y) tiberhaupt in verniinftiger Weise erklaren 
laBt. Hangt die Majorante von & ab [s. z.B. (1.11)], so ist es vielmehr zweck- 
maBig, ein ,,médglichst kleines‘’ Gebiet & zu benutzen’. Jedoch muB (1.17) 
[bzw. (1.20)] gelten. Praktisch geht man in diesem Falle so vor, daB man zu- 
nachst u9(*) und w(x) ermittelt und auf Grund des Verlaufes dieser Funktionen 
(5 so schatzt, daB es die Lésung u*(x) vermutlich enthalten wird. Genauer: 
man schatzt eine obere Schranke S(x) fiir die sich mit Satz 1 oder Satz 2 erge- 
bende Fehlerschranke o (x) =| u*(x) —,(x)| und richtet &, d.h. w(x) und p(x), 
so ein, daB p(x) + S(x) Su (x) Sy(x) — S(x) gilt, daB also & auch den Streifen: 
xEB, uy (x) —S(x)<y<u,(x) +S(x) enthalt. (Auf die méglichen Variationen 
bei Benutzung des Zusatzes 1.1 gehen wir der Einfachheit halber nicht ein.) 
Hat man dann eine Fehlerschranke o (x) berechnet, sieht man oft, daB ein kleineres 
Gebiet & und eine zugehorige giinstigere Majorante die Voraussetzungen auch 
erfiillt hatten, und man kann, wenn man moglichst gute Fehlerschranken haben 
will, die Abschatzung mit der neuen Majorante wiederholen. 


Verwendet man eine lineare Majorante f(x, y), so muB man bei nichtlinearer 
Funktion f(x, y) fast immer so vorgehen, denn eine solche Funktion / geniigt 
im allgemeinen nicht im ganzen Zylinder 


8: xEB, —o<y<o 


einer Lipschitz-Bedingung (1.12) mit festem a,(x). Dies etwas umstandliche 


Verfahren entfallt, wenn man eine nichtlineare Funktion /(x, y) wahlt, mit der 
(1.5) in ganz 8 gilt, was sehr oft méglich ist. Dann kann man & = setzen und 
braucht (1.17) nicht nachzupriifen. In diesem Sinne ist also die Rechnung bei 
nichtlinearer Funktion f(x, y) einfacher, falls /(*, y) nichtlinear ist (vgl. auch 
Beispiel 2). 

Allerdings sind die zu lésenden Ungleichungen (1.16) bzw. (2.9) dann in o(x) 
nichtlinear. Man kann deren Lésungen nicht iiberlagern. Bei den in den §§ 3 
und 4 gerechneten Beispielen mit nichtlinearer Majorante gelang es ohne 
Schwierigkeiten, brauchbare Schranken zu ermitteln. In anderen Fallen kann 
die Nichtlinearitat u.U. stéren. Dann wird folgendes Verfahren vorgeschlagen. 


8 Dagegen wahlt man @(7) in (1.4) immer so groB wie méglich, im allgemeinen kann 
man #= oo setzen. 
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Man schatzt eine Schranke S(x) fiir die Lésung o(x) der Aufgabe (1.16) bzw. 
(2.9), (2.10) — das entspricht also dem oben beschriebenen Vorgang — und ersetzt 
o(x) auf der rechten Seite von (1.16) bzw. (2.9) teilweise durch S(x), so daB eine 
in o(x) lineare (inhomogene) Funktion bleibt, z.B. (¢ +0)? S (o +,) (S +0)”. 
Besitzt die so linearisierte Ungleichung eine Lésung a(x) <S(x), so lést diese 
Funktion auch die urspriingliche Ungleichung, und man kann das Verfahren 
evtl. mit der berechneten Funktion o (x) als neuer Schranke S (x) wiederholen. Bei 
diesem Verfahren braucht man nicht an allen Stellen die gleiche Schranke S (x) 
zu verwenden und man nutzt auf diese Weise unter anderem den Vorteil nicht- 


linearer Funktionen f(x, y) aus, oft bessere Fehlerabschatzungen zu liefern. 
Wir erlautern, wann mit nichtlinearen Majoranten genauere Fehlerab- 
schatzungen zu erwarten sind. Der Einfachheit halber betrachten wir nur Funk- 


tionen I(x, y), welche mit Hilfssatz 1 ermittelt werden kénnen. Ist } (x, v) linear, 


so hangt /,(x,|y|) in (1.8) nicht von y ab, eine nichtlineare Funktion f(x, ¥) 
dagegen kann das Anwachsen der Funktion | f,(x, y+w)| mit y oder —y bertick- 
sichtigen. Je starker |/,(x, y+-w)| mit y oder —y zunimmt, desto giinstiger ist 
es im allgemeinen, eine nichtlineare Majorante zu verwenden. — Erfiillt f(x, y) 
die Voraussetzungen des Hilfssatzes 2, ist G(x, &)20 und wXu,ySu, und ver- 
wendet man o,=%— %, 09=%)— w und die Majorante (1.10), so ist o=u*—u,° 
Lésung der Aufgabe (2.9), (2.10) mit dem Gleichheitszeichen in (2.9). In diesem 
Falle sind also die hier abgeleiteten Fehlerabschatzungen fiir | w*— u,| nicht weiter 
zu verscharfen, und durch geniigend sorgfaltige Berechnung einer Schranke a(x) 
aus (2.9), (2.10) kann man — jedenfalls theoretisch — beliebig nahe an | u*— 2,| 
herankommen. Ein solches Resultat ist bei nichtlinearem f(x, y) offensichtlich 


nur moéglich, wenn man auch nichtlineare Majoranten f(x, y) betrachtet. — 
Natiirlich hangt es stark von den speziellen Eigenschaften des gerade vorliegenden 
Problems ab, welche Majorante zu wahlen ist. 


2.4. Das vereinfachte Newtonsche Verfahren. Es sei nun eine Differential- 
gleichungsaufgabe 


Lluj =h(x,u), U,[u] =y,; (tem 42s nes) (2.13) 
mit linearem, homogenem Differentialoperator L gegeben. Man kann versuchen, 
sie mit dem Iterationsverfahren 

Elta] =h(x,u,),  U; [Mya] =; (t= 4, 2.5.3 0 = On ly esd) 


zu lésen und unsere Ergebnisse fir M—L und f=h anzuwenden. Oft ist es 
jedoch zweckmaBig, die gegebene Aufgabe umzuformen, M und / also anders 
zu definieren. Will man dann die obigen Ergebnisse benutzen, mu8 man nach- 
priifen, ob die geforderten Voraussetzungen mit den verwendeten GréBen M und f 
erfiillt sind. 

Das ,,an der Stelle w(x) angesetzte vereinfachte Newtonsche Verfahren“ 
(s. [2] bei w =Up) fiir die Aufgabe (2.13) hat die Gestalt (2.7) mit 


M[u] =L[u] —h,(x,w(x))u, f(x,y) =h(x, y) — h,(x,w(x))y. (2.15) 


* Aus den in [4] genannten Ergebnissen folgt hier u,(*) Su*(x). 
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Es ist im allgemeinen schwieriger durchzufiihren als das Verfahren (2.14), liefert 
aber vielfach bessere Werte und fiihrt bei geeigneter Funktion w(x) oft auch 
dann zum Ziel, wenn (2.14) nicht konvergiert. 

Das vereinfachte Newtonsche Verfahren la8t sich als Verfahren der Form 
(2.7) mit /, (x, w(x)) =0 charakterisieren. Wegen dieses Verschwindens der ersten 
Ableitung ist es besonders zweckmaBig, eine nichtlineare Majorante zu ver- 
wenden. Sind A(x, y) und h,(x, y) auf & (1.3) stetig und gilt 


(lf, (#9) — fy (%,2)| =) — [hy (%, 9) — hy (,2)| S ae (x) |v —2| 


te (2.16) 
fir— “ES, (4) Sy, 2=y(*) 
mit einer stetigen Funktion a,(x), so erfiillt nach Hilfssatz 3 z.B. 
f(x, ¥) = 3 42 (x) »° (2.17) 


die Voraussetzungen (1.5). 


Im allgemeinen benutzt man w=w«,. Durch geeignete Wahl einer anderen, 
u, benachbarten Funktion w(x) kann man jedoch manchmal erreichen, daB der 
Ausdruck h, (x, w(x)) und damit die Iterationsvorschrift einfacher wird. 


Eine weitere Moglichkeit, eine einfachere Iterationsvorschrift zu bekommen, 
die Vorteile des Newtonschen Verfahrens jedoch im wesentlichen beizubehalten, 
besteht darin, h, (x, w (x) durch eine stetige Naherungsfunktion q(x) zu ersetzen, 
u.U. eine Konstante. Das entsprechende Iterationsverfahren ist dann (2.7) mit 


M[u] =L{uj—q(x)u, f(x,y) =A(x,y) — g(x) ¥. 
Statt (2.17) erhalt man unter der Voraussetzung (2.16) die Majorante 


~ 


f(x, ¥) =a (%) ¥ + 3 42(4) 9°, 
wobei a, (x) eine stetige Funktion mit 
| 2, (%, @ (x)) — 9(x)| S a (x) 


ist. Das hier auftretende lineare Glied hat um so weniger Gewicht, je besser 
die Naherung q(x) ist. 


Beim vereinfachten Newtonschen Verfahren wird man oft u(x) riickwarts 
von u,(x) ausgehend berechnen. Besonders zweckmaBig erscheint dies z.B. im 
Spezialfall 


h(x,9) = Po(x) y? + bi (x) ¥ + Bo(*) (2.18) 
mit stetigen Funktionen #;(x) und .(*)=-0. Hier wird fir w=w, 
M[u] = L[w] — (2b2(x) u(x) + A1(2)) , 
f(x, ¥) = bo (x) ¥? — 22%) M(x) ¥ + bo(*)- 
Mit (2.15) und (2.18) geht die Differentialgleichung fiir (x) tiber in 
L[uy] — (2p2% + 1) 1 = — be ug + Po- 


Eine Lésung u, bei gegebenem 2, ist 


tg (x) = om (7) — san pax) «2D 
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wobei ¢(x) den Defekt der Differentialgleichung in (2:43) 


(x) = —L[m] + pout + pit + Po 
bedeutet. a 
Als Majorante (beziiglich w=») laBt sich nach (2.16), (2.17) die fiir xEB, 
O<y< o erklarte Funktion 


~ 


f(x = |Pal x)| y? 


verwenden. Mit o,(x) =sgn p2(%) : i 1 (x) —Mo(*)) und g(x) =0 lauten die Be- 
ziehungen (2.9), (2.10) 


L{o] — (2p2% +2:)6 —|P2|o?Zle(*)|, Ulo] =0 eas Hee e (2.20) 


Im Spezialfall (2.18) bestimme man u,(x) also so, daB diese Funktion die 
Randbedingungen U,[u,| =y, erfiillt und ¢(x) - sgn p,(*) nichtnegativ und még- 
lichst klein ausfallt. Als Fehlerabschatzung erhalt man dann | w*(x) — 14 (x)|<o(x) 
mit einer (2.20) gentigenden Funktion o(x), wenn fiir M und f in (2.19) die all- 
gemein geforderten Voraussetzungen gelten und die zugehérige Funktion G (x, &) 
nichtnegativ ist. Im Falle £,(x)>0 erfiillt o (x) =u*(x) — u,(x) die Bedingungen 
(2.20) mit dem Gleichheitszeichen in der Differentialungleichung. 


§ 3. Beispiele flr Randwertaufgaben 

3.1, Beispiel einer gewohnlichen Differentialgleichung. Man kann leicht gewisse 
Klassen von Randwertaufgaben bei gewohnlichen Differentialgleichungen an- 
geben, ftir welche die in Nr. 2.1 genannten Voraussetzungen erfiillt sind, ins- 
besondere also eine Greensche Funktion G(x, &) mit den geforderten Eigen- 
schaften existiert (zur Greenschen Funktion s. etwa [J], [6]; (1.1) ist z.B. erfiillt, 
falls G(x, &) auf Bx B stetig ist). Wir beschranken uns hier darauf, ein numeri- 
sches Beispiel zu rechnen. 

Beispiel 1: 

(M[w| =) — 4" =32%0— (=f (x, u)), w(O) =...) w(1) =i 


Das iibliche Mehrstellen-Differenzenverfahren (s. [3] oder [9]) 
M2 Allee + [1 10 1], f(x, #) =0 


zur Schrittweite h= g ergab die in Tabelle1, Spalte 2 stehenden Naherungs- 
werte # (%;) (x, =7h). Das bei diesem Verfahren auftretende nichtlineare Gleichungs- 
system wurde iterativ nach einer in [9] beschriebenen Methode gelést. Die dabei 
erforderliche Rechenarbeit war sehr gering, da die benutzte Methode fiir die 
Rechenanlage IBM 650 programmiert ist. Auch fiir die weiteren Rechnungen 
wurde die Rechenanlage herangezogen. 

Mit Hilfe der Ausgleichsrechnung wurden dann Konstanten a, b so bestimmt, 
daB die Werte u,(x,;) der Funktion 


U(x) =o (1 +%) + 2x(1— x) (a+ bx) 


die #(x;) im Sinne der Fehlerquadratmethode méglichst gut annahern (s. Ta- 


ll Spalt : 
belle 1, Spalte 3) a =0,007858, b=0,372027. 
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Von dieser Funktion u,(«) ausgehend erhalt man nach der Iterationsvorschrift 
Uji =3%XU,— UE, yi 3(0) = 4, - 4H 4,(1) =1 (= O42.) 
die erste Naherung 9 
uy (*) = % (A +%) + 2(1— x) Da, x 
(Labelle 1, Spalte 4 und Tabelle 2). Es ist s 
| 1 (x) — u(x)| So,(%) mit o,(x) =x(1—-x) a, % ==.0,0367.. 


Tabelle 1 (Beispiel 1) 


a 4 (x4) Uo(%;) Uy(%4) o(%;) 

0 OS 0,5 0,5 0,0 

1 0,580145 0,578 290 0,580 166 0,001 590 
2 0,661 302 0,660 787 0,661 328 0,002 923 
3 0,741 769 0,743 133 0,741 798 | 0,003 804 
4 0,818 845 | 0,820 968 0,818874 0,004 111 
5 0,888 686 0,889 932 0,888 699 0,003 804 
6 0,946 269 0,945 665 0,946251 0,002 923 
i 0,985 564 | 0,983 807 0,985 529 0,001 590 
8 1,0 cake 1,0 [oko 


Zur Fehlerabschatzung ermitteln wir eine einfache dapels 2g (Bereich) 


lineare Majorante. Im Gebiet j Oty 
OF O79) Paws, @ 0,134487 
1 0,196 987 
welches u(x) und uw, (x) enthalt, gilt 2 0,146 720 
3 0,059070 
ane fe 4 — 0,014 350 
lf, 9)| =3 || [2% —y| 83, 5 — 0,018 614 
Mbiks 6. ; 6 — 0,005 782 
so da wir hierfiir nach Hilfssatz 3 die Funktion 7 — 0,002688 
bs 8 — 0,000 797 
f(x,y) =3y 9 0,000468 


benutzen kénnen. Alle erforderlichen Voraussetzungen sind erfiillt, insbesondere 
gilt auch (2.8). Man bekommt fiir o(x) die linearen Bedingungen 


—o" 230 +30, G.1) 
o(0) =o(4) =O. (3.2) 
Die Funktion 
=, 2, a : ey : la oa) ores as F ae as 
a(t) => sag (sin V3 # + sin J (1 — 2) — sin Y3} — ax (1 — x) 


(Tabelle 1, Spalte 5) 
erfiillt diese Forderungen, und zwar mit dem Gleichheitszeichen in (3.1). Es gilt 
oe ¢$o(x) <m(x) <1 —o(x). (3.3) 
Nach Satz 2 existiert also eine Lésung u*(x) mit 


| w*(x) —m(x)|So(x) (<0,00412). 
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Ein Ansatz 
a(x) =Ax(1—x)[1+%(1—+%)] (A = const) 


zur Lésung der Aufgabe (3.1), (3.2), der so gewahlt ist, daB o’’ den Faktor x (41 — oa 
enthalt, fithrt auf A =;4,«. Auch fiir diese Schranke o(x) gilt (3.3), und es ist 
hierfiir o (x) =0,00417. 

Eine nichtlineare Majorante zu verwenden, hat bei dieser Aufgabe nicht viel 


Sinn, weil | /, («, y)| das in der obigen linearen Funktion /(x, y) benutzte Maximum 
3 = Max|f, (x, y)| fir x=1 und den Wert y=w*(1) =1 der Losungsfunktion 


~ 


annimmt. Eine optimale nichtlineare Majorante /(x, y) ist auch von kompli- 
zierterer Gestalt, da f,(x, y) in jedem die Lésung enthaltenden Gebiet das Vor- 
zeichen wechselt. Der giinstigste Wert A, welchen man mit einer optimalen 


Funktion /(x, y) und dem zuletzt genannten Ansatz fiir o(x) bekommen kann, 
ist A =4a, mit dem dann o (x) £0,003 83 ist. 

3.2. Eine Klasse partieller Differentialglerchungen. Wir nennen nun eine Klasse 
von Randwertaufgaben bei partiellen Differentialgleichungen, fiir welche die ge- 
forderten Voraussetzungen gelten. 

Es sei 8 ein offenes, beschranktes, einfach zusammenhangendes Gebiet der 
X1, X9-Ebene (kurz: x-Ebene) mit stetig gekriimmtem Rand I’: x,=«(s), x,=£(s) 
(s =Bogenlange; «’’(s), B’’(s) stetig [7]). Gesucht ist eine auf B=S4T stetige, 
auf 8 mit stetigen ersten und zweiten partiellen Ableitungen versehene Funk- 
tion u(x) =u(x,, %)), welche auf B die Differentialgleichung 


(M{u] =) — Aw = f(x, u) (3.4) 
erfiillt und auf I’ gegebene Werte 
u=y(s) (3.5) 


annimmt, wobei y(s) eine stetig differenzierbare Funktion bedeute und f(x, y) = 


/(*,, %2; vy) die in Nr. 1.1 unter c) genannten Eigenschaften besitze. Die Itera- 
tionsvorschrift lautet 


—Atijs1 =] (%,4,) aut B, wo == y(s)> aut ft (i= 0,423 ee 


Aus den in [5] und [7] dargestellten Ergebnissen kann man schlieBen, daB 
eine stetige Funktion g(x) und eine nichtnegative Greensche Funktion G(x, &) 
existieren, welche die in Nr. 2.1 genannten Voraussetzungen erfiillen. Ausfiihr- 
licher ist dies in [4] dargestellt, wo Probleme der Art (3.4), (3.5) in anderer Weise 
behandelt werden. 


Die Vergleichsaufgabe lautet hier 


—Ao=f(%,o+0,+ @) —f(x, 09) auf 8, oZo auf Tr. 


Man braucht nach Zusatz 2.3 auf I” nur 00 statt 6 =O zu fordern, da die L6- 
sung z der Aufgabe 


— Au = 0jauteS. weoananias 


ftir 20 nach dem hier giiltigen Randmaximumsatz (s. [3]) nichtnegativ ist. 
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Die Funktion ~, (x) ist Lésung einer Aufgabe 
—Auv=r(xje-aut GS, =u =yls) auf I" 


mit bekannter stetiger Funktion 7(«). Kennt man eine Funktion #,(x) welche 
die fiir #,(x) geforderte Differentialgleichung erfiillt, nicht jedoch die Randbedin- 
gung, so kann man auch den Fehler | u*(«) —#,(x)| abschatzen, denn nach dem 
Randmaximumsatz gilt 

| u(x) —%4(x)|S@ mit wo= Max |%, — y(s)|, 
also 


|,(x) — 49(x)| Soy(x) mit [a (x) — w(x) +oSa(x) 3.7) 
und 


| u*(x) — a, (x)| S| u*(x) — uy (x)| +o So(x) +0, 
wobei o(x) mit der Schranke o,(x) aus (3.7) zu berechnen ist. 
3.3. Beispiel einer partiellen Differentialgleichung. 
Betspiel 2 ist eine Aufgabe der in Nr. 3.2 behandelten Art: 
—Au=atxtew fir o<r<i, 
“u=0 fin aa, (7 == 2 Xe) 
mit einer positiven Konstanten «. 


Von 
Wy = all?) 


ausgehend berechnet man — *,=7 cos m setzend — 


2 
Uy = (1—7") \< as (13 1372 2374 Nl 975) a ree (97? 4474 b4r8)h 


Fordert man (0, 0) =%9(0, 0), so ergibt sich (s. Tabelle 3, Spalte 2) 


576 13.0 ee 1452 
dia) a —|/' — 32) |SITE fo Asses ae 


13 
und 
1 1 Bcd 6 
| u; — up| < (1 — 7) aes (3 2374 + or8| +1 |r — 1478 + 47 exe 
mit 
o=7(1—r)ad und d= LEST Salaam 


41529" 430. 15760 


Bei der Fehlerabschatzung verwenden wir nun verschiedenartige Majoranten. 


a) Im Gebiet 
Ge y0=754;. “coc co 


gilt fiir f(x,y) =a +27 y?: 
[fy (, to 9)| = 22t| a0 + y| S27? (uo +|91) = 1 (2, 


y|) 
mit = . 
((o.5) = {Gy a 27 Mo VT”, 
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welche Funktion man daher nach Hilfssatz 1 fiir w =, als Majorante verwenden 
kann. Die Vergleichsaufgabe lautet damit bei @9=0: 


=Aco—(2a7(t— 7) 4 27dAt— PY) e726" 
> 2a8d7lt — 7)? 4 at dtr 7) te Osere 1, 
o=0 tir 7=1. 
Der Ansatz o=A(1—/r4) (A =const) fiihrt auf die Ungleichung 
ANNO = 2417?) A 74) ae 2aed ere g*)\— A 72 
22847 (1—7PPe@PAnt—rFryP fie 07 =4- 


Mit 


(ier) eo) STA ee) ee 
(=A/s1, 7U—7)? <= 35, (4 — 7*)2 

erhalt man die quadratische Gleichung 
A?— A[16—a(2+4ad)]+@(ad4+ 7a a) =0 


als hinreichende Bedingung fiir A. Eine nichtnegative Lésung A(«) existiert fiir 
AS%qX 21,8 (Tabelle 3, Spalte 3). Nach Satz 2 besitzt also fir solche Werte « 
auch die gegebene Aufgabe eine Loésung w* und es gilt die Fehlerabschatzung 


| w*(x1, %_) — MW, (%, %)| S Ala) (4—74) fiir a Saye 21,8. 


Tabelle 3 (Beispiel 2) 


ot 4,(0,0)=a(a) _ | (l¥*(0, deol) | (le*(0, a0, 0)|S) 
2 0,502852 0,000123. | 0,000126 
4 1,011 55 0,001 08 | 0,001.43 
6 1,52629 0,004 05 | 0,004 37 
8 2,047 3 0,0107 0,0120 
10 2,5748 0,023 7 | 0,0284 
12 3,109 14 0,047 1 0,0601 
14 3,650 0,088 ly O27 

16 4,199 0,159 0,313 

18 4,755 0,289 | = 

20 5,319 0,578 a 
21,8 5,83 1,60 — 


b) Wir benutzen nun eine lineare Majorante und schranken © ein auf 
OF OsS7 4. 0S yee —7?) 
mit noch zu wahlender Konstanten c. In diesem Gebiet ® ist 


| 7, (%, y)| 2677 (1 Sar)s 


10 Schatzt man hier genauer ab, bekommt man nur wenig bessere Ergebnisse. 
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so daB man 


Mee 2077 (les 7 ey (3.8) 
und das Vergleichsproblem 


—Ao—2¢7(1—r\oZ2acdrAi—7)? fir OXr<1, 
o{0) =0 AU nly 2 | 
verwenden kann. Der Ansatz o=B(1—7r4) (B=const) fithrt auf 


B[16 — 2¢(1 — 7) (4 — #4) ] S 2a®%cdr(1 — r)2. 


Wegen 
(ee A eh ete (Vee) (3.9) 
genugt 
eo 20° cd- 
— 16—2¢ 


fiir c<8 dieser Ungleichung. 


c muB so bestimmt werden, daB die Bedingung (1.17) erfiillt ist. Dies fiihrt 
auf die Forderung 


BYA— 1) <m(x)<ct —7) — BU — 1). (3.10) 


Wir schatzen u, in entsprechender Weise ab wie die unter a) aufgetretenen 
Funktionen: 
WS +%HS Cdr(41—7) +a(1 —7*) 


und bestimmen c aus der fiir die zweite der Ungleichungen (3.10) hinreichenden 
Bedingung 


Bdr?(1—7) -ai—r)sct— 7) — Bi —7) 
zu 
c=@d+tat+2B. 
Setzt man diesen Wert in (3.9) ein, so ergibt sich fiir 6 die quadratische Gleichung 


B?— Bl4—a(ge + fad)|+@(f-ad+ 3,0) =0, 


welche fiir 
aS aH, & 16,6 


eine nichtnegative Lésung besitzt. 


Bei unmittelbarer Anwendung des Satzes 2 muB man noch die erste der 
Ungleichungen (3.10) nachpriifen. Darauf kann man nach Zusatz 1.1 jedoch 
verzichten. Wegen g(x) =0, G(x, €)20 und f(x, y)20 gilt die erste der Un- 
gleichungen (1.21) namlich mit ®(x) =0, und fiir diese Funktion @(x) ist 


Max | ®(x), , (x) — o(x)] 20 =¢—(x). 
Man erhalt damit die Fehlerabschatzung (s. Tabelle 3, Spalte 4) 


| w* (x1, %2) — 4 (%, %)| SB(«)(1—7') fir aSa, 16,6. 
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Bei dieser Aufgabe konnten wir durch Mitfiihren eines Parameters c ein 
nahezu optimales Gebiet © der angesetzten Form bekommen, brauchten also 
nicht, wie in Nr. 2.3 beschrieben, zu probieren. Trotzdem erscheint der Rechen- 


gang unter a) einfacher. 


§ 4. Beispiele fiir Anfangswertaufgaben 


4.1. Gewéhnliche Differentialgleichungen. Wir behandeln Aufgaben der Gestalt 


(M[w] =) «'— p(x) u = f(x, 4) 


(4.1) 
u(O) =a 
und fragen nach (stetig differenzierbaren) Lésungen im Intervall 6 =[0, «]. 
p(x) sei stetig und f(x, y) erfiille die in Nr. 1.1 unter c) genannten Voraus- 
setzungen. 
Der gegebenen Aufgabe dquivalent ist die Integralgleichung (2.6) mit den 


Funktionen 
x 


eo J P(n)an 
[plé) dé 


es fir 6 
PACA We itn und ona =|" 


fir £22, 


welche die in Nr. 2.1 verlangten Eigenschaften besitzen. Die Vergleichsaufgabe 
lautet 


~ 


a — p(x)o =i (x,¢ +0,+ 00) —f(%, 09) fiir O<x<a, 
O 


o(0) = 


Wir rechnen ein einfaches Beispiel. 


Beispiel 3: 
“=x fir OSfe4S0, 4(0) =4. (4.2) 


Die Aufgabe soll mit dem bei w=, angesetzten vereinfachten Newtonschen 
Verfahren gelést werden. Diesem Verfahren entspricht die Umformung der Dif- 
ferentialgleichung in die Gestalt 


(M[u] =) u’ — 2ugu =x? + [(u — wu)? — v2] (== (45%). 
Die Funktion h(x, w) auf der rechten Seite der Differentialgleichung in (4.2) 


ist von der speziellen Form (2.18). Wir benutzen das fiir diesen Fall am SchluB 
der Nr. 2.4 beschriebene Rechenverfahren, haben also u, (x) so zu bestimmen, daB 


é(x) = —m +22+ 42 


im Intervall [0, «] nichtnegativ und méglichst klein ausfallt, und erhalten dann 
die Fehlerschranke o (x) aus (2.20): 


o'—2u(x)o—@Ze(x) fir OSx<Sa, (0) =0. (4.3) 
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Als sehr einfache Naherungsfunktion u(x) verwenden wir zur Erlauterung 
der Methode zunachst den Abschnitt 


wW=1+%+ 2 (Tabelle 4, Spalte 2) 
der Potenzreihenentwicklung der exakten Lésung u*(x). Dann ergibt sich 


é(%) = 4227+ 2x3 + x4, 


und der Ansatz o =C x* (C =const) zur Lésung von (4.3) fiihrt auf die Forderung 


Q(x, C) =C?«4—C[3 —2(x +22 +%3)]+44+2x%4+250 fir 0Sx<a, 
d.h. O(a, C) <0. Wir setzen Q (a, C) =0. Fiir «<a, 0,5095 existiert eine nicht- 
negative Lésung C(«) dieser Gleichung. 
Fir jedes solche « und *<« erhalt man also die Fehlerschranke o = x? - C(«). 

Indem man x =« verwendet, ergibt sich die Abschatzung 

|u*(x) — u,(x)|Sx8C(x) fir OS % Sa.x0,5095 (Tabelle 4, Spalte 3). 
Die Potenzreihenentwicklung der Schranke x*C(x) nach x beginnt mit $23, 
dem nachsten Glied in der Taylor-Reihe der Lésung. Theoretisch kann man 
hier — wie auch bei der Funktion (4.4) — eine beliebig genaue Schranke erhalten, 
denn fiir o=u*—u, — es ist u,Su* — gilt in der Differentialungleichung von 
(4.3) das Gleichheitszeichen. 


Tabelle 4 (Beispiel 3) 


| 3 
x metpate | (lut(z)—m(ols) ects) | = 24+ (u(x) —2(x)|<) x*D(x) 
| a 
O 0,0 0,0 0,0 0,0 
O,1 bale | 0,001 52 1,111444 0,000 020 
OR? 1,24 0,0142 1,252667 0,000 381 
0,3 1,39 0,0589 1,437 57 0,002 46 
0,4 1,56 0,189 1,6880 0,0108 
0,5 175 0,75 2,041 7 0,043 1 
0,6 1,96 = 2/2: 0,294 
Benutzt man die Funktion 
a 4 3 
i; =~ +7 _ (Tabelle 4, Spalte 4), (4.4) 
1 3 


eine Summe von Lésungen der Differentialgleichungen 


WWE nel OY See 


so bekommt man mit dem Ansatz o =D x* (D =const) in entsprechender Weise 
die Fehlerabschatzung 


| u*(x) — uy (x)| S x*- D(x) 
Arch, Rational Mech. Anal., Vol. 2 


fir 02% Sag 056110 


(Tabelle 4, Spalte 5). 
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Man beachte, daB u*= “s ist (= lést die Anfangswertaufgabe u’=w?, 
u (0) =1), so daB die Lésung u*(x) eine Singularitat fiir einen Wert x =%)S1 | 
besitzt, tiber den man mit der Fehlerabschaétzung nicht hinauskommen kann. 
4.2. Eine Klasse partieller Differentialgleichungen. Es sei 8 das aus den 
Punkten 7%, v5 wit 
4S Xb, hia a=aa Ws) 


bestehende Gebiet der x,, %,-Ebene. Dabei bedeute /(x,) eine auf einem Inter- 
vall aX<x,<b erklirte Funktion mit stetiger Ableitung h’(x,)>0. 
Gesucht ist eine auf 8 stetige und mit stetigen Ableitungen uw, , U,,, Uz, x, VeT- 
sehene Funktion u(x) =u(x,, %2), welche die Gleichungen 
(M[u] =) —4,,,,=1(%,%,4) auf 8, 
(U, [w] 23) U(%, Xa) = 1 (%), (U; [] =) Uy(%,%) =Yo(%) fiir x = h(x) 


erfiillt, wobei die y;(x,) gegebene stetige Funktionen bedeuten und f(x, y) = 
f(%, %, vy) die in Nr. 1.1 geforderten Eigenschaften besitze. 


Die gegebene Aufgabe ist der Integralgleichung 


%, h(E) 


uly ma) = (Hla) +f yelG d+ SS HE ba, lbs, $e) dads, 


aquivalent, wenn H die Umkehrfunktion von h bedeutet. Diese Integralglei- 
chung 1aBt sich in der Form (2.6) mit 


oe 


(x) 


fy i (ge ee ee 
G08) =Cla mi 8) =| (%) S454, SoS h(%) 


0 sonst auf $ 


schreiben. Die Funktionen g(x) und G(x, &) haben die in Nr. 2.1 geforderten 
Eigenschaften. 


Die Vergleichsaufgabe lautet hier 
074, = 1%, Haro oo, 06) 1 ee Bo) auf %, 
G(%,%_) 20, Oy, (%1,%)) 20 fir 4 =h eae 


Nach Zusatz 2.3 braucht man nur ¢=0, o,,=0 zu fordern, da die Aufgabe 


=—U,,—=0 aur 6; u=0,  u,=0, fir #,—A(x,) 
in diesem Falle eine nichtnegative Lésung z besitzt. 
4.3. Beispiel einer partiellen Differentialgleichung. 
Als Beispiel 4 rechnen wir die Aufgabe 
thy gy = 1 a Oe) TO Ss apne too eee %y) 


u=u,=0 fir x,=% 
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mit einer positiven Konstanten «. Um ein Problem der in Nr. 4.2 behandelten 
Art zu bekommen, schranken wir den Bereich, in welchem die Differential- 
gleichung erfiillt sein soll, zunachst ein auf 


Ue 0, eae ee, I OO =O (4.5) 
Der Ansatz u,)=0 liefert 
Uy = G(X — % 2)? (2%, + %), 


Uy = Uy (x1 — %2)* (5 x1 + 4% %_ + 29). 


Wir schatzen den Fehler der Funktion wu, ab, indem wir im Satz 2 ™ statt m9, 
Uz statt u, benutzen. Als Majorante laBt sich wegen /,=«x, im Gebiet 


Ge te eS, = Oy = 60 
die Funktion 


1(%,¥) =f (%4, %3 Y) =any 


verwenden. Mit o,=u,—u, lautet dann das Vergleichsproblem 


4 


5, (ty X,)* (54? 4-44, %, 1-42) aut GB, 


= —= = 
Ox, x» 4X0 = 240 


o=o0,,=0 fiir *,=%. 
Der einfache Ansatz o =c(x.— x,)® (c=const) filhrt auf die Forderung 


© (30 — a x (x4 — ¥9)*) SA ay (5.08 + 4m xy + 28) auf %. 
Im Bereich (4.5) ist 

My (%y — Hy)? O(b—a)?, 4, (5 42 +40 Hg + 23) S 1083. 
Gilt «b(b —a)?<}30, so existiert also eine nichtnegative Lésung 


a2 58 


24 (30—ab(b—a)?) ~ 


Da man, um fiir eine beliebige feste Stelle x,, x, mit O< x,< 4%, eine Schranke 
zu bekommen, diese als Eckpunkt 0, a des verwendeten Dreiecks 8 auffassen 
kann, erhalt man die Fehlerabschatzung 

a HF (4% — %2)8 
24 (30—« *, (4, —%)?) 


| w* (1, Xp) — Me (%1, %2)|S 
fiir alle x,, x. mit OS %,S% und a %,(% — %2)?<30. 


Nachtrag wihvend dey Korrektur. Bei den hier mitgeteilten Fehlerabschatzungen 
fiir Differentialgleichungen werden zwei gema8 (2.7) verkniipfte Funktionen u, (7%) 
und w,(¥) benutzt. Aus diesen Ergebnissen lassen sich im Falle G(¥, ) 20 Abschatzun- 
gen folgern, fiir die man nur die Ausgangsnaéherung w(x) bendtigt, welche die ge- 
gebenen Randbedingungen nicht zu erfiillen braucht. Statt u,(¥) —u9(¥) werden 
dabei der Defekt d= —M([u,]+/(%, uw) und gewisse ,,RandgroBen verwendet. 
Dariiber soll in Kiirze noch etwas ausfiihrlicher berichtet werden. 
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